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1.  Introduction  and  summary 

In  this  thesis  we  will  deal  with  selection  procedure 
problems  involving  multivariate  normal  populations.  These 
problems  are  of  two  distinct  types:  The  first  involves  k 
bivariate  normal  populations,  wl'ere  the  goal  is  to  select  the 
"best"  population.  The  second  involves  one  (k+1) -variate  nor¬ 
mal  population,  where  the  goal  is  to  select  the  "best"  set  of 
a  preassigned  number  of  variates.  Our  major  interest  will 
center  on  problems  of  prediction  and  hence  the  "best"  popula¬ 
tion  or  "best"  set  of  /ariates  will  be  determined  on  the  basis 
of  measures  of  "goodness"  such  as  the  simple  population  corre¬ 
lation  coefficient  and  the  multiple  population  correlation 
coefficient.  We  will  limit  our  consideration  to  single-stage 
procedures . 

The  problem  of  selecting  a  variate  or  set  of  variates  for 
prediction  of  a  designated  variate  occurs  in  many  areas. 
Hotelling  [34]  formulated  such  a  problem  as  a  hypothesis  test¬ 
ing  problem.  We  formulate  this  problem  and  similar  but  more 
general  ones  as  multiple-decision  problems.  Our  major  objec¬ 
tive  is  to  provide  a  rational  basis  for  determining  the  scunple 
size  necessary  to  insure  that  the  probeibility  of  correctly 
selecting  the  "best"  population  (or  set  of  variates)  is  suffi¬ 
ciently  high  whenever  the  "best"  population  (or  set  of  vari¬ 
ates)  is  better  than  the  "next  best"  population  (or  set  of 
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variatas)  by  at  laaat  a  pradatarmlnad  amount. 

1 . 1  The  literature 

A  considerable  literature  or  selection  procedures  already 
exists.  This  discussion  is  not  intended  to  be  a  complete  re¬ 
view  of  the  literature  of  multiple-decision  selection  proce¬ 
dures,  but  rather  to  provide  the  reader  with  some  idea  of  the 
general  nature  of  the  previous  work  which  is  relevant  to  this 
thesis . 

Although  there  are  many  ways  in  which  selection  proce¬ 
dures  can  be  formulated,  the  two  most  common  formulations  in 
the  literature  are  the  "indifference  zone"  approach  as  pro¬ 
posed  by  Bechhofer  [S]  and  the  "subset"  approach  as  proposed 
by  Gupta  [27]. 

Bechhofer  [5]  formulated  and  solved  the  "indifference 
zone"  approach  for  the  problem  of  ranking  the  means  of  k  uni¬ 
variate  normal  populations  with  common  known  variances,  em¬ 
ploying  a  single-stage  procedure.  Bechhofer,  Kiefer,  and 
Sobel  [9]  have  written  a  monograph,  Sequential  Identification 
and  Ranking  Procedures ,  in  which  they  discuss  sequential  solu¬ 
tions  to  the  problems  of  ranking  parameters  of  Koopman-Darmois 
populations  (the  normal  means  problem  thus  being  considered  as 
a  special  case) .  This  monograph  also  contains  a  very  complete 
bibliography  on  multiple-decision  selection  and  ranking 
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procedures  (including  both  the  "indifference  zone"  and  the 
"subset"  approach) .  Paulson  (42]  has  proposed  a  sequential 
procedure  for  the  normal  means  problem  which  is  quite  differ- 
ent  from  that  described  in  [9]. 

Bechhofer,  Dunnett,  and  Sobel  [7]  gave  a  two-stage  proce¬ 
dure  for  the  normal  means  problem  when  the  common  variance  is 
unknown  and  more  recently  Robbins,  Sobel  and  Starr  [46]  have 
given  a  sequential  solution  to  this  problem.  Several  other 
papers  have  considered  the  "indifference  zone"  formulation  for 
other  univariate  distributions  and/or  other  parameters.  The 
paper  on  ranking  variances  of  univariate  normal  populations  by 
Bechhofer  and  Sobel  [10]  is  of  particular  relevance  to  this 
thesis. 

A  similar  development  has  occurred  for  the  "subset"  for¬ 
mulation  of  the  ranking  problem.  We  will  be  concerned  only 
with  the  "indifference  zone"  approach  in  this  thesis  and  refer 
the  reader  to  Gupta  [27]  for  a  description  of  the  "subset" 
approach . 

While  most  of  the  previous  work  has  dealt  with  univariate 
populations,  some  recent  papers  (Alam  and  Rizvi  [1], 

"f 

Gnanadesikan  [23],  Gupta  [28],  Gupta  and  Panchapekesan  [29], 

^During  the  period  in  which  this  thesis  was  being  written 
in  final  form,  this  paper  was  delivered  at  the  Second  Interna¬ 
tional  Symposium  on  Multivariate  Analysis.  The  problem  that 
we  consider  in  Section  2.3  is  a  special  case  of  the  one 
described  in  this  paper. 
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Gupta  and  Studden  [30]  ,  Krishnaiah  [36] ,  Krishnaiah  and  Rlzvi 
[37],  and  Thornby  [49]),  using  a  v_riety  of  approaches,  have 
considered  single-stage  procedures  for  ranking  problems  in¬ 
volving  k  p-variate  (k  >  2,  p  ^  2)  normal  populations.  Here 
unlike  the  problems  involving  univariate  populations,  the 
vectors  or  matrices  of  parameters  do  not  have  a  single  "natu¬ 
ral"  ranking,  but  can  be  ranked  according  to  many  different 
criteria.  All  of  these  papers  consider  problems  wherein  the 
"goodness"  of  a  population  is  measured  in  terms  of  predeter¬ 
mined  univariate  functions  of  its  parameters.  The  problems 
considered  in  these  papers  can  be  categorized  on  the  basis  of 
these  functions.  One  category  consists  of  those  problems  in 
which  this  function  is  the  Mahalanobis  distance  where 

and  are  the  population  mean  vector  and  the  population 
covariance  matrix,  respectively,  of  the  i^^  p-variate  popula¬ 
tion,  and  includes  problems  considered  in  [1],  [28],  [30],  and 
[37].  Another  category  consists  of  those  probleias  in  which 
this  function  depends  only  on  the  elements  of  the  and  in¬ 
cludes  problems  considered  in  [23],  [37],  and  [49].  A  third 

category  consists  of  those  problems  in  which  this  function  de¬ 
pends  only  on  the  elements  of  the  and  includes  problems 

considered  by  [23],  [29],  and  [36].  The  problems  considered 

in  Part  I  of  this  thesis  fall  into  this  tnird  category. 

The  problems  of  all  three  of  these  categories  appear  to 
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be  rather  artificial  and  are  mainly  of  theoretical  interest. 

In  this  thesis  onr  principal  reason  for  considering  problems 
of  this  nature  is  that  they  provide  some  insight  into  the 
solutions  of  problems  which  do  have  a  practical  significance. 
These  latter  problems  which  are  associated  with  a  single 
multivariate  normal  population  are  considered  in  Part  II  of 
this  thesis. 

Several  papers  have  also  been  written  on  another  class  of 
problems — ranking  the  variates  of  a  single  p-variate  popula¬ 
tion.  There  are  numerous  possible  "natural"  rankings  of  the 
variates  for  this  class  of  problems,  including  the  "natural" 
ranking  for  the  corresponding  problem  involving  k  univariate 
populations.  These  problems  are  complicated  by  the  correla¬ 
tion  structure  (which  may  be  unknown)  between  these  variates, 
since  most  of  the  general  theorems  concerning  ranking  (e.g  , 
see  Barr  and  Rizvi  (3])  assume  the  variates  to  be  independent. 

Bechhofer,  Elmaghraby  and  .Morse  (8)  have  given  a  single- 
stage  solution  for  the  problem  of  selecting  the  variate  with 
the  largest  single-trial  cell  probability  for  a  single  multi¬ 
nomial  population;  a  sequential  solution  has  also  been  given 
by  Bechhofer,  Kiefer,  and  Sobel  19]. 

Gnanadesikan  [23]  has  given  a  single-stage  solution  for 
the  "subset"  approach  to  the  problem  of  selecting  the  variate 
with  the  largest  standardized  population  mean  for  both  the 
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ca«a  of  a  known  and  an  unknown  oovarlanea  matrix.  Bechhofer, 

Kiefer  and  Sobel  (9]  have  shown  that  their  sequential  solution 

for  Koopman-Darmois  populations  holds  for  the  problem  of 

selecting  the  variate  with  the  largest  population  mean  from  a 

single  multivariate  normal  population  v/ith  a  known  covariance 

matrix  of  the  form  °  and  also  for  the 

^ J  ■‘•J  t  po  iftj  J 

problem  of  selecting  the  variate  with  the  largest  population 
variance  from  a  single  bivariate  normal  population  with  known 
or  unknown  population  means  and  known  population  correlation 
coefficient.  The  problems  considered  in  Part  II  fall  within 
this  general  class  of  problems,  which  involve  ranking  the  var¬ 
iates  of  a  single  multivariate  population. 

1 . 2  Summary 

This  thesis  consists  of  two  distinct,  but  related  parts. 
In  both  parts  our  ultimate  objective  will  be  to  provide  a 
rational  basis  for  determining  the  sample  size  for  an  experi¬ 
ment  in  which  the  goal  is  to  select  the  "best"  population  (or 
set  of  variates)  when  certain  probcibility  requirements  (which 
will  be  described  precisely  in  J ater  sections)  are  to  be 
guaranteed.  The  formulation  that  we  adopt  here  falls  within 
the  framework  of  the  "indifference  zone"  ranking  approach  as 
proposed  by  Bechhofer.  (5J.  Many  of  the  ideas  considered  in 
this  thesis  could  also  be  carried  over  to  the  "subset" 
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formulation  of  Gupta  (27)  ,  but  we  do  not  consider  his  approach 
here. 

In  order  to  determine  the  exact  sample  size  required  for 
the  "indifference  zone"  approach,  one  must  know  the  exact 
joint  distribution  of  the  statistics  on  which  the  decision 
procedure  is  based.  For  many  of  the  problems  which  we  consid¬ 
er  in  this  thesis,  in  particular  those  in  Part  II,  the  deter¬ 
mination  of  these  exact  distributions  appears  almost  hopele.s. 
Hence  we  will  find  the  asymptotic  joint  distribution  of  these 
statistics  and  work  with  it  in  the  scime  manner  as  one  would 
with  the  exact  joint  distribution  were  it  known.  That  is,  we 
will  obtain  the  infimum  of  the  asymptotic  probability  of  cor¬ 
rect  selection  over  the  region  of  preference  for  a  correct 
selection  and  choose  our  sample  size  in  such  a  way  that  this 
infimum  satisfies  the  probability  requirement.  For  certain  of 
these  cases  the  exact  infimum  of  the  asymptotic  probability, 
referred  to  above,  is  also  difficult  to  obtain,  and  in  these 
cases  we  find  a  lower  bound  to  this  asymptotic  probcibility  and 
obtain  the  infimum  of  this  lower  bound.  Using  these  results, 
we  find  a  conservative  approximation  to  the  asymptotic  seunple 
size  necessary  to  satisfy  the  probability  requirement. 

In  Part  I  we  consider  problems  concerning  the  selection 
of  a  "best"  population  from  a  set  of  k  independent  bivariate 
normal  populations.  We  will  be  interested  in  attempting  to 
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predict  one  of  the  variates  of  the  bivariate  norjnal  population 
on  the  basis  of  the  other  variate.  The  "goodness"  of  this 
prediction  will  be  measured  in  terms  of  three  different 
criteria — the  population  conditional  variance,  the  algebraic 
value  of  the  population  correlation  coefficient,  and  the 
^d^solute  value  of  the  population  correlation  coefficient. 

All  three  of  the  problems  considered  in  this  part  are 
rather  artificial  and  are  mainly  of  theoretical  interest.  Our 
principal  reason  for  studying  them  is  that  they  provide  some 
insight  into  similar,  but  more  difficult  problems  (which  do 
have  practical  importance)  considered  in  Part  II. 

In  Part  II  we  consider  the  problem  of  selecting  the 
"best"  set  of  a  preassigned  number  t  of  variates  from  a  set  of 
k  (t  <  k)  variates  (which  we  term  the  predictor  variates)  for 
predicting  a  designated  variate  (which  we  term  the  predictand) 
in  a  (k+1) -variate  normal  population.  Throughout  this  part 
the  "best"  set  of  predictor  variates  will  be  defined  to  be 
that  set  of  t  predictor  variates  for  which  the  predictand  has 
the  smallest  population  conditional  variance  or  equivalently 
that  set  of  t  predictor  variates  with  which  the  predictand  has 
the  largest  population  multiple  correlation  coefficient. 

In  Chapter  3  we  give  the  formal  problem  statement  in  its 
most  general  form  along  with  the  notation  and  an  expression 
for  the  asymptotic  probability  of  correct  selection  for  this 
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problem. 

Throughout  Chapter  4  we  asuume  that  the  predictor  vari¬ 
ates  are  uncorrelated  and  seek  the  asymptotic  sample  size 
which  satisfies  the  probcibility  requirement.  For  (k  =  2, 
t  «  1)  we  accomplish  this  objective  by  finding  the  infimum  of 
the  asymptotic  probability  of  correct  selection.  For  k  >  2  we 
obtain  approximations  to  the  asinnptotic  sample  size  by  finding 
lower  bounds  on  the  asymptotic  probcibilities  of  correct  selec¬ 
tion  and  then  obtaining  lower  limits  for  these  bounds  over  the 
region  of  preference  for  a  correct  selection.  For  (k  >  2, 
t  »  1)  Me  use  the  Slepian  inequality  (Appendix  B)  to  obtain 
this  lower  bound  on  the  asymptotic  probability  of  correct 
selection  and  for  (k  >  2,  t  >  1)  we  use  the  Bonferroni  in¬ 
equality  (Appendix  B)  to  obtain  this  lower  bound. 

In  Chapter  5  we  drop  the  assumption  of  uncorrelated 
predictor  variates  and  proceed  in  a  manner  similar  to  that 
described  in  Chapter  4,  restricting  consideration  to  the  case 
t  «  1.  For  (k  =  2,  t  =  1)  we  obtain  the  exact  asymptotic 
sample  size  which  satisfies  the  probability  requirement.  For 
(k  >  2,  t  *  1)  we  use  the  Bonferroni  inequality  to  obtain  a 
lower  bound  on  the  probability  of  correct  selection  and  then 
find  the  infimum  of  this  lower  bound  over  the  region  of  pref¬ 
erence  for  a  correct  selection  and  the  corresponding  approxi¬ 
mation  to  the  asymptotic  sample  size. 
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In  Chapter  6,  we  suggest  some  areas  of  future  research. 

The  asymptotic  distribution  theory  of  the  sample  statis¬ 
tics  used  in  the  decision  procedures  of  Part  II  is  given  in 
Appendix  A.  The  Bonferroni  and  the  Slepian  inequalities  are 
given  in  Appendix  B.  In  Appendix  C  two  sample  size  approxima 
tions  derived  from  these  inequalities  for  the  problem  of  rank 
ing  the  means  of  normal  populations  with  common  known  vari¬ 
ances  are  compared  with  the  exact  sample  size. 


PART  I 


k  Bivariate  Normal  Populations 

In  Par*;  I  we  will  consider  situations  in  which  we  have  k 
independent  random  vectors^  — i  "  »  ^i2^  '  each  of  which  has 

a  bivariate  normal  distribution  with  unknown  mean  vector 
U-  *  (u-w  M-t)  and  unknown  covariance  matrix  E.,  where  the 
elements  of  I-  are  given  by  o-  __  {r,s  =  1,2).  Throughout 
this  part  we  will  be  interested  in  attempting  to  predict  one 
of  the  variates  of  a  bivariate  normal  population  on  the  basis 
of  the  other  variate.  The  "goodness"  of  the  prediction  will 
be  measured  in  terms  of  the  population  conditional  variance, 
the  algebraic  value  of  the  population  correlation  coefficient, 
and  the  edjsolute  value  of  the  population  correlation 
coefficient. 

Our  objective  will  be  to  provide  a  rational  basis  for  de¬ 
termining  the  seunple  size  for  an  experiment  in  which  the  goal 
is  to  select  the  "best"  bivariate  population  when  certain 
probability  requirements  (which  will  be  described  in  later 
sections)  are  to  be  guaranteed.  The  formulations  that  we  will 
adopt  fall  within  the  frsunework  of  the  so-called  "indifference 
zone"  ranking  approach  as  proposed  by  Bechhofer  [5]. 

^For  simplicity,  we  w:.ll  not  distinguish  notationally 
between  random  variables  and  their  observed  values. 
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2.  Three  formulations 


We  consider  the  three  formulations  of  Part  I  In  this 
chapter.  For  the  first  formulation  we  use  the  exact  joint 
distribution  of  the  statistics  on  which  the  decision  pror 
Is  based.  In  order  to  determine  the  minimum  seunple  si:’'' 
will  guarantee  the  probeiblllty  requirement. 

For  the  latter  two  formulations  we  find  the  joint  anymp- 
totlc  distribution  of  a  transformation  of  the  statistics  used 
In  the  decision  procedures,  and  work  with  this  asymptotic 
distribution  In  the  saune  manner  as  one  would  with  the  exact 
distribution.  That  is,  we  obtain  the  infimum  of  the  asymp¬ 
totic  probability  of  correct  selection  over  the  region  of 
preference  for  a  correct  selection  and  then  choose  our  sample 
size  in  such  a  way  that  this  Infimum  satisfies  the  probability 
requirement. 

2.1  Conditional  variance  formulation 

In  Section  2.1  we  will  be  concerned  with  the  problem  of 
selecting  the  "best"  population  frcxn  a  set  of  k  Independent 
bivariate  normal  populations  n^(i  »  1,  2,  ....  k) ,  the  "good¬ 
ness”  of  the  being  measured  in  terms  of  the  population 
conditional  vaz lances 

<2.1)  Var(x^^|x^2^  “ 
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We  denote  the  ranked  values  of  the 


®[11;1|2  -  °t2];l|2  -  -  ‘’[k];ll2‘ 


It  is  assumed  that  the  true  pairing  of  the  11^  with  the 
^[j)*l|2  unknown  to  the  experimenter,  and  that  he  has  no 
a  priori  knowledge  which  is  relevant  to  the  true  pairing  of 
any  of  the  popuJations  with  the  ranked  values  of  the 


2.1.1  The  goal  and  the  probability  requirement 

In  this  section  we  consider  the  following  goal: 

(2.2)  "To  select  the  associated  with  ®[i].ij2'" 

The  terra  correct  selection  (CS)  will  then  denote  the  action  of 
selecting  the  population  associated  with  more 

than  one  11^  is  such  that  the  associated  cj^.2^|2  equal  to 
o|ij.i|2»  then  the  selection  of  any  one  of  these  is  termed 
a  CS.) 

Before  experimentation  begins  the  experii..  nter  must  spec¬ 
ify  two  constants  {0*,  P*}  with  1  <  0*  <  «<  and  1/k  <  P*  <  1, 
which  are  then  incorporated  into  the  probability  requirement. 
The  numerical  values  of  these  constants  are  assumed  to  depend 
on  the  economic  considerations  of  the  particular  problem.  The 
probability  requirement  can  then  be  stated  as 

PCS  >  P*  whenever  >  e* 


(2.3) 
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The  decision  procedure  which  we  propose  in  the  next  sec¬ 
tion  guarantees  the  probability  requirement  when  the  sample 
sizes  are  chosen  appropriately. 


2.1,2  Single-stage  decision  procedure 

Ne  will  base  our  decision  procedure  on  che  values  of  a 
predetermined  number  N  of  independent  obsc  <  atio'.a 


(2.4)  -  (x|P^ 


,j>=_  2,...  .'i') 


on  each  n^(i  ■  1,  2,  . . .  ,  k)  ,  from  which  we  wi.l  '.aLculate  the 
values  of  the  K  sample  conditional  variances 

N 


(2.5) 


X  .  £  x'P>/N 

P«1 


(j-lr2)  , 


(2.6) 


'^i;jm  "  ~  *iV  ^^im^  "  ^im^ 


{j,m-l,2) , 


and 


(2.7)  *i;l|2  "  ^^i;ll  ■  '’'i;12/’^i>22^/^”"^^  • 


We  denote  the  ranked  values  of  the  by 


■llj;l|2  ^  ®(2Jil|2 


<  ...  <  s 


[IcJ  >1|2 


In  addition  we  let  ^|2  denote  tl.a  sample  co  itiortl  var¬ 

iance  associated  with 

* 

For  this  single-stage  procedure ,  the  experimenter 
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proe««d«  as  follows t  Ha  takas  N  pairs  of  observations  from 
each  of  the  k  computes  the  values  of  the  ®®" 

lects  the  population  according  to  the  following  djeclslon  rule: 

i 

♦ 

"Select  the  population  associated  with 
(2.8)  and  assert  that  this  Is  the  population  associated 

2.1.3  Probability  of  correct  selection 

The  probability  of  correct  selection  for  this  case  can  be 
written  as 


(2.3)  PCS  -  P{8ji).l|2  ^  ®(l);l|2 

By  a  special  case  of  Anderson's  (2]  Theorem  4.3.3,  we 

have  that  each  (N-2)s?.,  ,1^/0?,,  Is  distributed  as  chi- 

(i);l|2'  [ij  ;1  |2 

square  with  N-2  degrees  of  freedom  and  since  these  k  chi- 
square  variates  are  Independent,  this  problem  reduces  to  the 
problem  of  ranking  variances  of  normal  populations  already 
treated  by  Bechhofer  and  Sobel  (10).  They  give  exact  analyt¬ 
ical  expressions  for  the  PCS  as  well  as  tables  for  computing 

the  exact  sample  size  when  the  P*  and  k  values  are  such  that 

» 

this  sample  size  Is  small  (N-2  jc  20)  .  They  also  show  that 
when  the  values  of  P*  and  k  are  such  that  the  sample  size  is 
moderately  large,  a  close  approximation  to  the  sample  size  can 


be  calculated  from 


t 
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(2.10)  .  /n  -  2d(k,P*)/log  0*, 

where  N  n4-2  And  d(k,P*)  Is  given  in  Table  1  of  Bechhofer 

(5J. 


2 . 2  Correlation  coefficient  formulation 

In  Section  2.2  we  will  again  consider  the  problem  of 
selecting  the  "best"  population  from  a  set  of  k  Independent 
bivariate  normal  populations  n^(l  1,  2,  k) .  However  in 

this  section  the  "goodness”  of  the  11^  will  be  measured  In 
terms  of  the  population  correlation  coefficients  where r 
using  tlie  notation  of  Section  2.1, 

(2.11)  j^2/’^®i;li  °i;22* 

We  denote  the  ranked  values  of  the  by 

-1  <  P[i]  <  P[2J  i  •••  i  PjkJ  - 

As  before,  we  assume  that  the  experimenter  has  no  a  pri¬ 
ori  knowledge  which  Is  relevant  to  the  true  pairing  of  any  of 
the  with  the  ranked  values  of  the  parameters. 

2.2.1  The  goal  and  the  probability  requirement 

For  this  measure, of  "goodness"  of  the  IT^,  we  consider  bhe 


following  goalt 
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(2.12) 


'To  select  the  associated  with 


The  term  correct  sele»_cion  will  then  denote  the  action  of  se¬ 
lecting  the  population  associated  with  (i^  more  than 

one  is  such  that  the  associated  is  equal  to  then 

the  selection  of  any  one  oJ  these  11^  is  termed  a  CS.) 

Before  experimentation  begins  the  experimenter  must  spec¬ 
ify  two  constants  {6*,  P*}  with  0  <  6*  <  2  and  1/k  <  P*  <  1, 
which  are  then  incorporated  into  the  probability  requirement. 
For  this  goal,  the  probability  requirement  can  then  be  stated 


(2.13)  PCS  ^  P*  whenever  ^  *^()c-l]  ’ 


The  decision  procedure  which  we  propose  in  the  next  sec¬ 
tion  guarantees  the  probability  requirement  when  the  seunple 
sizes  are  chosen  appropriately. 


2.2.2  Single-stage  decision  procedure 

Our  single-stage  decision  procedure  is  based  on  the 
values  of  the  k  sample  correlation  coefficients  r^  where, 
using  the  notation  of  Section  2.1.2, 


(2.14)  Vj.22. 


We  denote  the  ranked  values  of  the  r^  by 
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^[1]  ^  ^[2]  ^  ^  ^[k] ' 

and  in  addition  we  let  denote  the  sample  correlation  co¬ 

efficient  associated  with 

The  experimenter  proceeds  in  the  aeune  manner  as  in  Sec¬ 
tion  2.1.2,  using  the  following  decision  rule  in  place  of 
(2.8)  : 


"Select  the  population  associated  with  and 

(2.15)  assert  that  this  is  the  population  associated  with 

P(k]-" 


Eaton  (15]  has  shown  that  this  decision  rulif  is  minimax 
and  also  is  most  economical  (Hall  [31],  [32])  within  the  class 
of  dll  decision  rules. 


2.2.3  Probability  of  correct  selection 

The  PCS  for  this  problem  can  be  written  as 

(2.16)  PCS  =  >  r^j  (i  =  1,  2,  ...,  k-1)}. 

Because  of  the  unwieldy  form  of  the  exact  distribution  of 
the  when  pj^  0,  we  will  attack  this  problem  using  the 
asymptotic  distribution  of  Fisher's  variance  stabilizing 
transformation 

2.  =  (1/2)  log((l  +  -  r^)) . 


(2.17) 
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The  approach  of  the  distribution  of  the  to  normality  is 
much  more  rapid  than  that  of  the  distribution  of  the  r^.  The 
are  asymptotically  unbiased  estimators  of  the 

(2.18)  =  (1/2)  log((l  +  p^)/(l  -  p.)), 

and  have  asymptotic  variances  1/n,  where  n  =  N  -  3.  (The  -3 
is  a  small-sample  correction.) 

We  denote  the  ranked  values  of  the  by 

^11]  -  ^[21  -  -  ^(k]* 

Since  is  a  monotonic  increasing  function  of  p^,  the  ranked 
parameters  ^(ij  associated  with  the  same 

population. 

In  a  similar  manner  we  denote  the  ranked  values  of  the 
by 

^[1]  ^12]  ^  ^  ^Ik]  • 

We  also  let  and  denote  the  estimators  of  P[i] 

and  respectively,  i.e.,  ^[i]»  ^(i)  ^(i) 

all  associated  with  the  same  population. 

Hence,  by  letting 

(2.19)  =  (^/2)((z^^J  -  Zjj^j)  -  (5j^j  -  Cjj^j)), 
the  PCS  can  be  written  as 
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(2.20)  PCS  =  P{y.  <  (/n/2)(Cf,^j  -  (i=l,2, . . .  ,k-l)  )  . 

Since  asymptotically  (N  »)  the  variates 
y2'  •••'  ^k-l  ®  multivariate  normal  distribution,  the 

asymptotic  probability  of  correct  selection  PCS  can  be  given 
as 

( 2 . 21)  PCS^  *  ^k~l^^l*  "^2^  •••»  T  > 
where 

^i  “  <'^n/2)  -  C(ij)  f 

and  is  a  (k-1)  -variate  standard  normal  distribution  func¬ 

tion  with  zero  means,  unit  variances  and  off-diagonal  covari¬ 
ances  of  1/2. 

The  parameter  configuration  in  the  region  of  preference 
for  a  correct  selection,  for  which  the  PCS  is  minimized  is 
called  the  least  favorable  configuration  (LFC) .  Since  we  will 
be  working  with  the  asymptotic  distribution,  we  will  denote 
the  parcimeter  configuration  where  the  PCS^  is  minimized  by 
LFC,. 

The  following  lemma  will  be  used  later  in  proving  theo¬ 
rems  concerning  the  LFC^.  Fixing  Pjj^j 


=  p ' ,  we  have : 
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Lemma  2 . 1 


(2.22) 


infimum 


PCS^  « 


‘’(k-ii  <  p'  -  «* 


where 


T'  -  (/n/2) (?•  -  6')  , 

C*  =  (1/2)  log((l+p')/(l-p')) , 


6'  -  (1/1)  log((l+p‘-6*)/(l-p'+6*)) 


This  infimum  is  attained  when 


(2.23)  pjij  »  p'-6' 


(i=l,2,...,k-l) . 


Proof: 

We  use  Rizvi's  [45]  Theorem  1  and  the  monotone  likelihood 
ratio  property  of  the  normal  density  to  show  that  PCS^^  is  a 
nonde ::reasiiig  function  of  and  a  nonincreasing  function  of 

5j^j(i  =  1,  2,  ...,  k-1)  for  the  decision  rule  (2.15). 

Since  is  a  monotone  increasing  function  of  P[i]»  the 

infimum  of  the  PCS.  is  attained  at  ^r.,  =  -  6'(i  = 

a  1 1 J 

2,  k-1)  and  we  have  the  desired  conclusion. 


Using  this  lemma  ve  now  find  tha  LFC^. 
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Theorem  2.1 


infimum  PCS  *  ,  (d,  d,  d)  , 

(2.24)  ® 

^Ik]  -  Plk>l] 

where 

d  »  (✓n/2)log((l  +  6*/2)/(l  -  6*/2) )  . 


The  corresponding  LFC^  is 

P(i,  -  -<!V2 

(i-l,2,...,k-l) 

(2.25) 

P,^,  -  6V2. 

Proof ; 

Using  Lenuna  2.1,  the  problem  reduces  to  finding  the  infi- 
mura  of  t'  when  6*-l  ^  p'  1-  Setting  the  derivative  of  t' 
(taken  with  respect  to  p')  equal  to  zero  we  obtain  p'  ■  S*/2. 

Differentiating  a  2nd  time  with  respect  to  p'  and  evalu¬ 
ating  this  expression  at  p'  «*  6*/2  we  have 

(1  +  6*/2)”‘(l  -  6*/2)”*  -  (1  -  6*/2)'*(l  +  6*/2)"*, 

which  can  be  shown  to  be  >  0  for  all  6*  >  0  by  noting  that 

(2+6*)/(2-6*)  <  ((2+6*)/(2-6*) ) *. 

Hence  p'  »  6*/2  yields  a  minimum,  and  the  LFC^^  is 
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P[il  =  -6V2  (i=l,2,...,k-l) 

(2.26) 

P[k]  =  P’  = 

The  asymptotic  saunple  size  can  be  calculated  by 

(2.27)  /n  -  2d(k,P*)/log{ (1  +  6*/2)/(l  -  6*/2)}, 
where  d(k,P*)  is  given  in  Table  1  of  [5). 

2.2.4  An  additional  restriction 

In  some  cases  the  experimenter  may  be  interested  in  guar¬ 
anteeing  the  probability  requirement  only  when  pjj^j  >  p* ,  a 
preassigned  constant,  or  he  may  have  information  that 
P[)t]  -  P**  Formally: 

(2.28)  PCS  ^  P*  whenever  pjj^j  >  +  5*  and  Pjj^j  ^  p* . 

For  this  new  probability  requirement,  it  is  obvious  that 
when  -1  ^  p*  ^  6''/2,  the  result  of  Theorem  3.1  still  holds, 
however,  when  >  6*/2,  we  obtain  a  new  LFC^  and  a  reduction 

a 

in  the  asymptotic  sample  size. 

Theorem  2.2 


infimum  “  ^k-1^^'  ***'  ' 

^Ik]  -  ^[k-1]  * 

P(k]  >  P*  >  5V2 


(2.29) 


where 


d  =  (/n/2)log{  (1+p*)  (l-p*+6*)/(l-p*)  }. 

The  corresponding  LFC^  is 

Q 

P[i]  -  p*  -  6*  (1=1,2,. 

(2.30) 

"(kj  =  P* 

Proof : 

We  again  use  Lemma  2.1  which  reduces  the  problem  to  find¬ 
ing  the  value  of  p'  where  the  Inflmum  PCS  is  attained.  Just 

P*  <  P’ 

as  before  we  note  that  p'  appears  only  as  an  argument  of 
and  since  is  a  monotonic  Increasing  function  of  each  of 

Its  arguments,  we  need  only  minimize  t'.  Next  we  show  that 
the  derivative  of  t'  with  respect  to  p*  Is  nonnegative,  l.e., 

3tV3p'  -  {{1+p')  d-p')  }■*  -  {(l+p‘)  d-p’)  + 

p’6*  - 

>  Or 


when 


6*/2  <  P'  <  1. 

We  use  this  result  along  with  the  result  of  Theorem  2.1  (unre¬ 
stricted  minimum  occurs  at  6*/2)  and  the  continuity  of  the 
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function  to  complete  the  proof  of  the  theorem. 

Again  using  Table  1  of  [5]»  the  asymptotic  seunple  size 
can  be  calculated  from 

(2.31)  /n  =  2d(k,P*)/log{(l+p*) (l-p*+6*)/{l-p*) (l+p*-6*) }. 

2.3  Absolute  value  of  the  correlation  coefficient  formulation 
In  Section  2.3  we  consider  the  cUssolute  value  of  the  pop¬ 
ulation  correlation  coefficient  =  I  I  as  the  measure  of 
"goodness"  of  the  and  are  interested  in  selecting  the 

with  the  largest  from  a  set  of  k  independent  II^.  We  denote 

the  ranked  values  of  the  by 

0  <  c,ij  <  C(2,  i  ...  i  <  1. 

Again,  we  assume  that  the  experimenter  has  no  a  priori 
knowledge  which  is  relevant  to  the  true  pairing  of  any  of  the 
with  the  ranked  values  of  the  parameters . 

2.3.1  The  goal  and  the  probability  requirement 
The  corresponding  goal  is: 

(2.32)  "To  select  the  associated  with 

The  terra  correct  selection  (CS)  will  then  denote  the  action  of 
selecting  the  populat.i.Dn  associated  with  (if  more  than 

one  is  such  that  the  associated  is  equal  to  then 
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the  selection  of  any  one  of  these  is  termed  a  CS.) 

The  experimenter  must  specify  the  saune  two  constants  as 
in  Section  2.2.1  (in  this  case  0  <  6*  <  1)  .  The  probability 
requirement  can  then  be  stated  as 

(2.33)  PCS  ^  P* ,  whenever  ^  ^[k-1] 

The  decision  procedure  which  we  propose  in-  the  next  sec¬ 
tion  guarantees  the  probability  requirement  when  the  sample 
sizes  are  chosen  appropriately. 

2.3.2  Single-stage  decision  procedure 

Our  single-stage  decision  procedure  is  based  on  the  abso¬ 
lute  values  of  the  k  seunple  correlation  coefficients  t^  » 

I  r^  I  ,  where  the  r^^  are  given  by  (2.14).  We  denote  the  ranked 
values  of  the  t^,  by 

t(l]  <  t(2]  i  ^(k)  • 

In  addition  we  let  denote  the  sample  quantity  associated 

with 

The  experimenter  proceeds  in  the  same  manner  as  in  Sec¬ 
tion  2.1.2  using  the  following  decision  rule  in  place  of  (2.8) 

"Select  the.  population  associated  with  and 

(2.34)  assert  that  this  is  the  population  associated  with 


If 
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2.3.3  Prob£d?ility  of  correct  selection 

The  PCS  for  this  problem  can  be  written  as 

(2.35)  -  P{t^j^j  >  t^j  (i  =  1,  2,  . ,  k-1)  }. 

Because  of  the  unwieldy  form  of  the  exact  distribution  of 
the  tj^  when  0,  we  will  attac)c  this  problem  using  the  fact 

that  the  asymptotic  di itribution  of  (2.17)  is  normal. 

We  define 


(2.36)  w^  =  (l/2)log((l+t^)/(l-t.)) , 

(2.37)  *  (l/2)log((l+i;i)/(l-Ci))  , 

and  denote  the  raniced  values  of  the  w^  and  tha  by 


''(IJ  ^  ''(2] 


< 


<  w 


IK] 


and 


(1] 


<  f 


IK]  ' 


respectively.  In  addition  we  let  the  denote  the  esti¬ 
mators  of  the  '•'[ijr  and  hence  '*(i)'  *’[i]  '*’’il 

all  associated  with  the  same  population. 

Since  and  are  symmetrical  functions  i-f  r^  and  p^, 
respectively,  ed>out  zero,  it  foa.lov'E  that 


(2.38) 
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and 

(2.39)  =  Kil . 

Using  (2.38),  (2.39),  and  some  results  obtained  by  Rizvi  [45] 
for  ranking  the  absolute  values  of  means  of  normal  popula¬ 
tions,  we  find  the  LFC^  for  our  problem  in  the  following 
theorem. 

Theorem  2 . 3 

The  LFCg  which  satisfies  the  conditions  of  the  probabil¬ 
ity  requirement  (2.33)  is  given  by 

^[i]  “  °  (i«=l,2,...,k-l) 

(2.40) 

^[k]  ‘ 

Proof : 

Since  and  9w^/9tj^  are  positive  when  0  ^  1 

and  0  ^  ^  1,  respectively,  we  have  by  Theorem  1  of  Rizvi 

[45]  that  the  PCS^  is  a  nonincreasing  function  of  and  a 

nondecreasing  function  of.  2,  ...,  k-1)  .  Conse¬ 
quently  for  any  fixed  nonnegative  value  of  =  C'  (say)  , 

(2.35)  is  minimized  subject  to  the  restriction  of  (2..)3)  by 
setting 

^[i]  -  C’  -  6*  (1  =  1.2 . k-1)  . 

(2.41) 
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'^[k]  - 

The  equivalent  of  Rizvi's  Theorem  2  for  ;  follows  from 

^  0.  (Note:  Rizvi's  9^  values  are  our  4*^  values.) 
These  results  along  with  the  LFC  given  by  Rizvi  for  his 
problem  yield  the  LFC^  stated  in  this  theorem. 

The  asymptotic  sample  size  can  be  calculated  from 

(2.42)  /n  »  2A(k,P*)/log{ (l+6*)/(l-6*) ), 

where  the  X  values  are  given  in  Table  II  of  Rizvi  (45). 

2.3.4  An  additional  restriction 

If  we  add  another  restriction  to  (2.33) ,  similar  to  that 
of  Section  2.2.5,  the  new  probability  requirement  is  given  by 

(2.43)  PCS  ^  P*  whenever  ^  *’()t-l)  * 

and  ^  C*. 

Under  this  new  probability  requirement,  it  is  obvious 
that  when  0  ^  ^  6*,  the  result  of  Theorem  3.3  still  holds. 

However,  when  >  6*,  we  obtain  a  new  LFC.  with  a  correspond- 

ing  reduction  in  sample  size. 

Theorem  2.4 

The  LFC-  which  satisfies  the  conditions  of  the 

Cl 
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probability  requirement  (2.43)  is  given  by 

'^[i)  -«*-«*  U.1,2 . k-1) 

(2.44) 

«lkl  *  «*• 

Proof ; 

This  proof  follows  in  the  Scune  manner  as  the  proof  of 
Theorem  2.3. 

The  asymptotic  sample  size  can  be  calculated  from 

(2.45)  /ii  =  2X(k,P*)/log{(l+6*)  (l-c*+6*)/(l+i;*-6*)  (1-6*)  }, 
where  the  values  of  A  are  given  in  Table  II  of  [45]. 


PART  II 


One  (k-H) -Variate  Normal  Population 


In  Pa;  t  II  we  will  consider  situations  in  which  we  have 
a  random  vector  x  »  (Xq,  Xj^)  which  is  a  (k+1)- 

variate  normal  distribution  with  unknown  mean  vector  » 

(Uo#  •••r  Ujj)  and  unknown  covariance  matrix  T.  Throughout 

this  part  we  will  be  interested  in  predicting  the  variate  Xq 
{v.’hich  we  term  the  predictand)  on  the  basis  of  the  best  linear 
combination  of  variates  in  sets  of  fixed  size  t  of  the  k 
(t  <  k)  variates  Xj^,  x^r  ...»  Xj^  (which  we  term  the  predictor 
variates).  For  any  given  set  of  t  predictor  variates,  the 
"goodness"  of  the  prediction  will  be  measured  in  terms  of  the 
population  conditional  variouices  of  Xq  given  these  t  predictor 
variates,  tJg  i  i  »i  equivalently  in  terms  of  the 

population  multiple  correlation  coefficient  between  Xq  and 
these  t  predictor  variates,  Rn  ^  4  i  )•  f’or  fixed  (k,t) 

we  will  be  interested  in  these  U  =  parameters. 

Ultimately  our  objective  will  be  to  provide  a  rational 
basis  for  determining  the  sample  size  for  an  experiment  in 
which  the  goal  is  to  select  the  "best"  set  of  t  variates  when 
certain  probability  requirements  (which  will  be  described 
precisely  in  Section  3.3)  are  to  be  guaranteed. 
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The  general  problem  of  selecting  a  set  of  variates  to 
predict  a  specified  variate  is  an  old  one.  The  formulation 
that  we  adopt  here  falls  within  the  framework  of  the  "indif¬ 
ference  zone"  ranking  approach.  In  this  formulation  it  is 
necessary  to  know  the  exact  joint  distribution  of  the  statis¬ 
tics  on  which  the  decision  procedure  is  based »  in  order  to 
determine  the  minimum  sample  size  which  guarantees  the  prob- 
^d^ility  requirement.  For  all  of  the  problems  which  we 
consider  in  Part  II,  the  determination  of  these  exact  distri¬ 
butions  appears  almost  hopeless  (and  even  if  one  were  able  to 
find  them,  they  would  be  very  unwieldy)  .  Thus  we  will  attack 
these  problems  by  finding  the  asymptotic  joint  distribution  of 
these  statistics  and  work  with  it  in  the  seune  manner  as  one 
would  with  the  exact  joint  distribution,  were  it  known.  That 
is,  we  will  obtain  the  infimum  of  the  asymptotic  probability 
of  correct  selection  over  the  region  of  preference  for  a  cor¬ 
rect  selection  and  choose  our  sample  size  in  such  a  way  that 
this  infimum  satisfies  the  probability  requirement.  The 
asymptotic  joint  distributi'^  is  referred  to  above  are  derived 
in  Appendix  A.  We  will  study  the  various  special  cases  in  the 
different  chapters  of  Part  II.  For  certain  of  these  cases  the 
exact  infimum  of  the  asyinptotic  probability,  referred  to 
above,  is  also  difficult  to  obtain,  and  in  these  cases  we  will 


find  a  lower  bound  to  this  asymptotic  probability  and  obtain 


33 


the  infiraum  of  this  lower  bound.  Using  these  results,  we  will 
find  8(xne  conservative  approximations  to  the  asymptotic  sample 
sizes  necessary  to  satisfy  the  probed>ility  requirements. 


3.  General  notation 


In  Part  II  we  will  consider  sets  of  a  preassigned  number 
of  t  of  the  k  predictor  variates  and  will  use  the  following 
notation  to  label  these  U  *  different  sets.  Let  a  = 

02 »  ...»  otj^)  be  a  k-vector  consisting  of  zeros  and  ones 
k 

with  Z  Oj^  =  t,  a  given  integer  d  ^  t  ^  k-1)  ;  and  let  3^  = 

(x.  ,  X.  ,  ...»  X.  )  be  a  t-vector  obtained  from  the  k-vector 
^1  ^2  ^t 

X2»  ...»  Xj^)  of  the  predictor  variates  by  deleting  those 
x^  for  which  o^  *  0. 

In  a  similar  manner  we  let  and  denote  the  popula¬ 
tion  mean  vector  and  the  population  covariance  matrix,  respec¬ 
tively,  of  ^  and  let  Oq^  denote  the  vector  of  population 
covariances  between  the  random  variaUt^le  Xq  and  the  random 
vector  x^. 

In  addition,  we  denote  the  population  conditional  vari¬ 
ance  of  Xq,  given  the  set  of  predictor  variates  by 

(3.1)  Var(X(,|x^)  - 

and  the  population  multiple  correlation  coefficient,  between 
Xq  and  x^,  by 

*^0.a  '  ^  ' 

so  that  the  conditional  variance  of  Xq,  given  the  set  of 
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predictor  variates  x  ,  can  be  written  as 


'O.a 


°00  "  -0a^a*-6a 


(3.3) 


«0.a) 


3 . 1  Ranked  values  of  the  parameters — notation 

In  our  formulation  of  the  ranking  problem  we  will  be  in¬ 
terested  in  the  ranked  values  of  the  ai  ^  and  of  tne  Rn 

u .  a  u .  a 

For  fixed  t  we  denote  the  ranked  values  of  these  parameters  by 
°0.  (1]  -  ®0.  (2J  -  °0.  [U1 

and 

^0.  (1]  i  ^0.  [2]  ^  [U]  ' 

K 

where,  as  before,  U  =  C^.  Since  Rq  ^  ^  0 ,  we  have 
(3.4)  ^0.[i]  “  lU-i+1]^ • 

It  is  assumed  that  the  true  pairing  of  the  x  with  (or 

equivalently  with  Rq  jy_j^.j^j)  is  unknown  to  the  experimenter 
and  that  he  has  no  a  priori  knowledge  which  is  relevant  to  the 
true  pairing  of  any  of  the  populations  with  the  ranked  values 
of  the  pareuneters,  i.e.,  it  is  not  known  which  t  of  the  k 
variates  Xj^,  x^,  ...»  Xj^  are  associated  with  any  of  the 

""o.  (jj- 
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3.2  Goals 

In  Part  II  the  following  two  equivalent  gc^als  will  be  of 


interest  to  us: 


(3.5a) 


Goal  A:  "To  select  the  set  of  t  variates  associated 


(3.5b) 


Goal  B:  "To  select  the  set  of  t  variates  associated 

Ro.tu)-' 


The  term  correct  selection  will  denote  the  action  of 
selecting  the  set  of  t  variates  associated  with  Oq  (or 

equivalently  the  set  of  t  variates  associated  with 
(If  more  than  one  of  the  Oq  ^  are  equal  to  [1]  '  the  selec¬ 
tion  of  the  set  of  variates  corresponding  to  any  of  these  Oq  ^ 
is  considered  a  correct  selection.) 


3 . 3  ProbeUaility  requirement 

Before  experimentation  begins  the  experimenter  must  spec¬ 
ify  two  constants  {0*,P*}  with  1  <  0*  <  ®  and  1/U  <  P*  <  1, 
which  a-v'  then  incorporated  into  the  probed^ility  requirement. 
The  numerical  values  of  these  constants  are  assumed  to  depend 
on  economic  considerations  associated  with  the  particular 
problem.  The  probability  requirement  can  be  stated  as 


(3.6a) 


PCS  ^  P*  whenever  Oq  [2]'^°0  (1]  - 
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or  equivalently, 

(3.6b}  PCS  >  P*  whenever  (U-l]  ^ (Ul  ^  -  ®** 

The  decision  procedure  which  we  propose  in  the  next  sec¬ 
tion  guarantees  these  probability  requirements  when  the  sample 
sizes  are  chosen  appropriately. 


3 . 4  Ranked  values  of  the  stati3tic3--notation 

In  general  we  assume  that  the  experimenter  will  be  taking 
a  predetermined  number  N  of  independent  vector-observations 


(3.7) 


{p=l, 2 , . . . ,N) 


from  a  (k+1) -variate  normal  population.  In  v'^erms  of  the  N 
vector-obaervatio.ns ,  we  denote  the  sample  mean  of  the  i~ 
variate  by 

N  ^  j 

(3.8)  X,  -  Z  x'^^Vn, 

^  p-1  ^ 

and  the  vector  of  sample  means  and  the  matrix  of  sums  of 
squares  and  cross  products  of  deviations  about  the  mean  by 

(3.9)  X  -  (Xq,  x^,  ...,  ^); 
and 

* 

(3.10)  V  -  E  (x^P^  -  x)'(x^*^^  -  X), 

p-1 
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respectively. 

Using  the  a  notation  defined  in  Section  3,  the  sample 
mean  vector  and  the  sum  of  squares  and  cross  products  of  devi¬ 
ations  about  the  mean  matrix  of  x  are  given  by 

—a 


(3.11)  ^  i  • 


and 


(3.12) 


f  -  Z  (x  -  X  )  •  (X  -  X  )  , 
a  p»i  -a 


respectively.  The  usual  sample  quantities  associated  with 

"oo'  2oa'  "o-ci'  'o-d 


(3.13) 


N 


''00  ■  pfl'^O 


(p)  . 


Xfl)’, 


(3.14) 


*  pL'^O*”  ■ 


(3.15) 
and 

(3.16) 


“O.a  ■  <''00  -  i(0a''d‘!l0a'/<''-‘-l'- 


"O.o  ■  <<><Od''a‘!!Od»/''00>'^’- 


We  let  n  ■  N  -  t  -  1  and  note  that 


»5.a  ■  ''OO'I  -  "i.a’/*'- 


We  denote  the  ran)ced  values  of  the  si  and  the  by 

0 .  a  0 .  a 


2  2  ^  <a^ 

°0.(U] 


■o.[l]  0.[2] 
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and 


*^0.  tU  ‘^0.(21  ^  ^0.  [U]  ' 

respectively.  Since  R.  >  0,  we  have 

0 .  a  ■■ 

(3.18)  sj  «  Vqq(1  -  Rq.  ,  J-i+11^/"* 

In  addition  we  denote  the  sample  conditional  variance  and  the 
sample  multiple  correlation  coeffici^it  of  the  (t+1) -variate 
normal  distribution  associated  with  o*  and 
Sq  (i)  and  Rq.  (U-i+1)  i^espectively ,  so  that 

(3.19)  -  Vqq(1  -  Rq.  (u-i+l)^/*'’ 

3 . 5  Single-stage  decision  procedure 

In  Part  II  we  will  be  concerned  with  a  single-stage  pro¬ 
cedure  which  guarantees  the  probability  requirement  (3.8). 

For  this  single-stags  procedure  the  experimenter  proceeds  as 
follows:  He  talces  N  independent  vector-observations  from  the 
(k+1) -variate  normal  population,  computes  the  values  of  the  U 
sample  conditional  variances  Sq  ^  (or  equivalently  the  U 
s^ple  multiple  correlation  coefficients  Rq  ^j)  and  selects  the 
set  of  t  variates  according  to  the  following  decision  rule: 

"Select  the  pet  of  t  variates  associated  with  s^ 

(or  equivalently  i-,,)  and  assert  that  this  set  of 

(3.20) 
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variates  Is  associated  with  Oq  (or  equivalently 
^0. (U1 ^ 

3.6  Probedjility  of  correct  selection 

The  probability  of  correct  selection  (PCS)  using  the  pro¬ 
cedure  of  Section  3.5  can  be  written  as 

( 3 . 21)  PCS  *  P{Sq  (i)*'  ~  •••»  U)}» 

where  each  cf  the  n  Sq  (i)/°o  [i]  distributed  as  chi- 
square  with  n  degrees  of  freedom  (Anderson  [2] ,  Theorem 
4.3.3).  However  the  Sq  are  not  independently  dis¬ 
tributed  and  their  joint  distribution  does  not  appear  to  be 
known.  The  exact  distribution  would  appear  to  be  at  least 
as  complicated  as  that  of  the  joint  distribution  of  the 
simple  correlation  coefficients  in  seunples  from  a  multivariote 
normal  distribution,  which  is  quite  messy.  Since  knowledge 
of  this  distribution  is  necessary  in  order  to  determine  the 
minimum  required  sample  size  (see  the  analogous,  but  much 
simpler  problem  described  in  Section  2.1.4),  we  will  study 
this  problem,  from  the  large-sample  point  of  view.  (In  prac¬ 
tice,  "large"  samples  will  usually  be  required  when  applying 
this  procedure.)  By  using  the  variance  stabilizing,  logarith¬ 
mic  transformation  of  the  sample  conditional  variances,  we 

* 

obtain  an  asymptotic  approximation  for  the  PCS  which  should  be 
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sufficiently  accurate  to  determine  the  sample  size  requirement 
for  many  problems. 

We  define 

(3.22)  y|  =  (i=l ,  2  , .  .  .  ,U-1 )  , 

(3.23)  /n/ z )  log  (Oq  [ij/^0  [U“i+lJ^  {i=l>2/...»U—l), 
and 


“  y[  +  (i»l,2, . . . ,U-1) . 

Then  (3.21)  can  be  written  as 

(3.24)  PCS  »  P{w’  <  yJ  (1  »  Ir  2,  ...,  U  -  1)}. 

Using  Theorems  A.l  and  A, 2,  we  obtain  the  following 
asymptotic  (N  -►  «)  appro'timation  for  this  PCS,  which  we  denote 
by: 

(3.25)  PCS^  -  y'2’  •••' 

where  is  a  (U-1) -variate  normal  distribution  function 

having  zero  means.  (The  ♦  here  is  used  to  indicate  that  the 

variances  are  not  unity,  i.e.,  this  is  not  a  standardized 

*  • 

multivariate  normal  distribution  function.)  We  have  net  been 


able  to  determine  the  covariance  matrix  for  general  E  and 
arbitrary  t.  However,' in  Appendix  A  we  give  results  for 
general  I  when  t  »  1  and  for  a  special  form  of  I  (i.e.. 
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uncorrelated  predictor  variates)  for  arbitrary  t.  These 
results  will  be  used  in  the  following  chapters  to  determine 
the  asymptotic  S2unple  size. 


4 .  Uncorrelated  predictor  variates 

Throughout  Chapter  4  we  consider  the  case  in  which  the 
predictor  variatc-s  (k+1) -variate  normal 

distribution  are  uncorrelated,  i.e.,  =0,  (i  /  j;  i  = 

1,  2,  ...»  k) .  We  denote  the  covariance  matrix  of  this  (k+1) 
variate  normal  distribution  by  where  the  ij—  element  of 
Iq  is  given  by 


r 


\ 


(4.1) 


11 


(j  =  i;  i 
j; 


'v  ! 

The  mean  vector  u  and  the  nonzero  elements  of  are  assumed 

to  be  unknown.  The  assumption  of  uncorrelated  predictor  vari 
ates  yields  a  simplification  in  the  covariances  of  the  asymp¬ 
totic  joint  distribution  of  the  s^  ^  and  allows  the  require¬ 
ment,  that  Lq  be  nonnegative  definite,  to  be  expressed  in  a 
simple  form. 


4 . 1  t  =  1 

We  first  consider  the  situation  in  which  the  experi¬ 
menter's  objective  ia  to  select  t  -  1  variate  from  the  k 
possible  predictor  variates. 

4.1.1  Notation 


For  this  case,  thf.  general  notation  of  Chapter  3 
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simplifies  considerably.  The  k-vector  a  now  consists  of  k  •  1 
zeros  and  1  one.  If  we  let  i  denote  that  component  of  a  which 
is  one,  then 


(4.2) 


?-a  "  ^i' 


(4.3) 


°0.a  ”  ^O.i' 


^O.a  '  ^O.i' 


and 


since  ^  -  pQi'  becomes 


(4.5) 


'O.i  ^00 


^ nn  (  ^  "■  • 


We  denote  the  ranked  values  of  the  by 


P0(1)  -  p6(2J  -  • • •  ^  P0[k) ' 


and  write  (3.4)  as 


(4.6)  °0.(i)  "  ^00^^  ■  P0(k-i+ll^ 


Then  the  goal  (3.5)  can  be  stated  as: 


(4.7) 


'To  select  the  variate  associated  with  Po[)^]»" 


and  the  probeibility  requirement  (3.6)  becomes 


(4.8)  PCS  >  P*  whenever  (1  -  "  Po(k]^  - 


Similar  notational  simplifications  result  for  the 
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corresponding  sample  quantities,  l.e.,  (3.12),  (3.14),  (3.15), 
and  (3.16)  become 


(4.9) 

''ii  - 

(4.10) 

''Oi  ‘  -  *i'' 

(4.11) 

•o.i  ■  •''00  - 

and 

(4.12)  R„  ^ 

respectively.  Since  Rq  n  *  N  -  2,  (3.17)  becomes 

(4.13)  sj  ^  -  Vggd  -  rj^)/n. 

Denoting  the  ranked  values  of  the  by 

*^0(1J  *^0[2]  ^  ^  *^0(k)  ' 

(3.18)  becomes 

In  addition  we  denote  the  sample  correlation  coefficient  asso- 

» 

elated  with  Pqjj^j  ^^y  hence  (3.19)  can  be  written  as 

*0.  (i)  “  "  '^0(k-l+l) 


(4.15) 
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4.1.3  K  ^  2 

In  this  section  we  obtain  the  asymptotic  sample  size  by 
finding  the  infimum  of  the  PCS.  over  the  region  of  preference 

a 

for  a  correct  selection  for  the  case  (k  =  2,  t  «  1) .  The  com¬ 
plex  nature  of  the  correlation  (in  the  asymptotic  distribu¬ 
tion)  between  the  estimators  Sq  and  s^  ^2) '  which  depends 
on  the  covariance  matrix  £q  of  the  original  (k+l) -variate  nor¬ 
mal  distribution,  complicates  this  problem.  For  this  case 
(4.20)  reduces  to 

(4.22)  PCS^  -♦(£!), 

vrhere  t  (*4^^)  is  the  standard  univariate  normal  distribution 
function . 

Preliminary  to  finding  the  infimum  of  (4.22),  we  give  two 
lemmas.  Lemma  4.1  is  a  representation  of  the  requirement  that 
Zq  be  nonnegative  definite  (n.n.d.)  when  k  2.  Lemma  4.2 
shows  that  is  a  decreasing  function  of  PQ[j^]* 

Lemma  4.1 

For  k  s  2,  Zq  n.n.d.  is  equivalent  to  the  following 
inequalities : 

>  0  (i=0,l,2) 


oil  +  <>02  i  1 
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Proof ; 

This  equivalence  is  easily  ested^lished  by  using  the 
representation  that  a  symmetric  matrix  is  n.n.d.  if  all  of  its 
principal  minors  are  nonnegative. 

Lemma  4 . 2 

(4.23)  ^  ^  (i“l r 2 » .  . .  »k-l)  , 

when 

PQliJ  ^Olk]  -  ^ 

and 

‘  ”  P0[k]^  - 

Proof : 

We  let 

g  «  (/n/2)log((l  -  Poiij)/<l  *  Po(kJ^^ 
and  h  «  /I  -  so  that  »  g/h.  Then 

3g/^PQ|ij  =  -/n/(2{l  -  Pq^)) 

<  0 

and 


(4.24)  3h/3pQm  ■  f/(2h(i  -  ”  ^0[k]^^' 
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where 


(1  - 


’O(i) 


Po[Jcl^ 


To  show  that  (4.24)  is  positive  we  need  only  show  that  f  >  0. 
But 


af/apJ^ij  =  -2(1 


Poii]^ 


<  0, 


and  hence  f  is  a  decreasing  function  of  Pq^j*  To  show  that 
f  >  0,  we  increase  Pgji]  until  either 


'Oli) 


(1  -  0*)  +  9*p^ 


0[)c] 


or 


Po(i]  "  ^  ■  P0()^J' 

which  ever  occurs  first.  In  either  case  the  result  follows 
immediately.  Combining  these  results  and  noting  that  h  _>  0 
and  g  <  0,  we  obtain  the  desiied  conclusion. 


Theorem  4.1 


infimum  'Ke^^)  *»  '^(e*), 
(4.25)  >  6* 

n.n.d. 
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where 


e*  “  (/n/2)  log  0* 


and 

®12  “  "  Pc:2]^* 

Proof ; 

For  any  pair  of  values  of  Po[i]  ^0[2]  satisfying  the 

restrictions  of  the  theorem,  it  is  obvious  from  Lemma  4.2  that 
4>(ej^)  is  a  decreasing  function  of  Hence  to  obtain  the 

infimum  of  we  increase  Pqjjlj  until  it  attains  values  on 

one  of  the  two  boundaries  (Figure  4.1),  i.e.,  on 

(4.26a)  oJn,  -  (1  -  6*)  *  e«p5,2, 
or 


(4.26b)  =  1  -  p 


0(21* 


Along  either  of  these  boundaries,  can  be  expressed  as  a 
function  of  just  one  of  the  pair  of  parameters  Pqjj^j  ®ud 

^012] • 


Case  (a) 

After  solving  (4,. 26a)  for  Po(2]'  substituting  this 

expression  into  (4.16),  we  obtain  expressions  for  y 


Figure  4.1 

Preference  Region  (R)  for  a  Correct  Selection 


on  boundary  (a)  which  ws  denote  by  and  e 


X  •  6  •  / 


(0*(i  -  p*  )  -  0**p*  )* 

_ o[ii  o[i] 


(1  -  p 


OliJ 


) 


and 


»  (/n  log  0*)/(2/l  -  y^)  . 


Since  log  0*  >  0,  d  *  Yg)  >  0  and 


.<  0, 


,  respectively 


we  have 
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3e^/ap 


0[1] 


{/ri  log  0(1] 

2/r^ 

<  0. 


Hence  is  a  decreasing  function  of  Po[i]  infimum  of 

e^,  along  boundary  (a)  and  subject  to  the  conditions  of  the 
theorem,  occurs  at  the  intersection  of  boundary  (a,  and 
boundary  (b) . 


Case  (b) 

In  a  similar  manner  along  boundary  (b) ,  ai  ter  solving 
(4.26b)  for  Pqj2]  substituting  into  (4.16)  and  (4.18),  we 

obtain  Yb  =  0  and 

~  ('^n/2)  log((l  “  ^0  [  1 )  ^  0  ( 1 1  ^  * 

Since 

^^b/^^Od]  *  "'^”/^^P0(l’;  *  ^0(1]^^ 

<  0, 

Cj^  is  a  decreasing  function  of  Pq^ji’  and  the  infimum  of  , 
along  boundary  (b)  and  subject  to  the  conditions  of  the  theo¬ 
rem,  also  occurs  at  the  intersection  of  boundary  (a)  and 
boundary  (b) . 

Hence  the  infimv^  of  ♦(c^)  over  this  region  is  attained 
at  the  intersection  of  the  two  boundaries.  Solving  {4.26a) 
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and  (4.26b)  for  Po[i]  and  Po[2‘i'  obtain  the  LFC  (Section 
2.2.3) 

Po(i]  “ 

(4.27) 

P0I2]  =  ' 

and  the  conclusion  of  the  theorem  follows. 

Using  the  result  of  this  theorem,  an  asymptotic  approxi¬ 
mation  to  the  sample  size  can  be  obtained  by  setting  the 
r.h.s.  of  (4.25)  equal  to  P*,  from  which  we  obtain 

(4.28)  /n  -  2*“MP*)/log  6*, 

where  is  the  inverse  of  the  standard  normal  distribution 

function  ♦. 

Alternate  Proof; 

The  theorem  can  also  be  proven  indirectly  by  noting  that 
^  whenever  the  restrictions  of  the  theorem  are  satis¬ 
fied  and  hence 

infimum  4>(e^)  ^  ♦(€*). 

(4.29)  >  0* 

Eq  n.n.d. 

Substituting  the  parameter  values  given  by  (4.27)  into  (4.16) 
and  (4.18),  we  see  that  the  equality  is  attained  and  the  proof 
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is  complete. 

The  rather  long  direct  proof  was  given  because  it  pro¬ 
vides  some  insight  into  the  solution  of  more  general  problems 
considered  in  later  sections.  In  addition  some  of  the  lemmas 
associated  with  this  theorem  are  used  in  the  proofs  of  some  of 
the  theorems  concerning  these  problems. 


4.1.4  k  >  2 

For  k  >  2  the  problem  of  finding  the  infimum  of  the  PCS^ 
(4.20)  is  complicated  by  the  fact  that  both  the  and  the 
are  functions  of  the 

To  illustrate  these  complications,  we  briefly  consider 
the  case  k  “  3.  Using  the  results  of  Plackett  [43)  for  the 
reduction  of  multivariate  normal  integrals,  we  have^ 


(4.30)  PCS^  »  *=2^ 


“  ♦(ei)*(e2)  +  »<Poill'  Pot21^' 


where 


(4.31) 


12 


H(pJjij%  Pj{2j^  “ 


71)  j  (1//1  -  l“) 


^Gnanadesikan  (231  has  used  this  method  to  obtain  some 
numerical  results  for  the  "subset"  approach  to  some  selection 
procedure  problems  involving  a  multivariate  normal 
distribution. 
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exp 


-  (e^  ^2  ”  2Xej^e2) 

2(1  -  X2) 


dX. 


The  difficulties  encountered  in  finding  the  infimum  of 
the  PCS^  using  this  expression  become  apparent  by  taking  the 
derivatives  of  (4.30)  with  xespect  to  the  However  we 

do  note  that  by  combining  Lemma  4.5  and  (4.30)  we  obtain  the 
following  inequality: 

(4,32)  ^2^  -  ' 

which  is  a  special  case  of  Theorem  4.3. 

Because  of  the  complexity  of  the  PCSg  expression  for 
k  »  3»  we  circumvent  this  problem  for  k  >  2  by  finding  a  lower 
bound  for  the  PCS^.  Preliminary  to  this  we  prove  three  lemmas 
from  which  this  bound  will  follow  directly  as  Theorem  4.2. 


Lemma  4 . 3 

n.n.d.  Is  equivalent  to  the  following  inequalities: 

^ii  i  ®  (i»0,l, . . . ,k) 


k 

Z 

i«l 


1. 


Proof : 

« 

We  denote  the  submatrix  of  Zq  formed  by  deleting  the 
P  (1  ^  p  <  k)  rows  ij^,  12 f  ip  and  columns 
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^1/  ^2'  *  * • '  by 


»  ^2  ^  * 


'  S'  S'  S 


9  •••  9 


0  <  <  ij  < 


.  <  i  <  k 

P  - 


0  <  ji  ^  12  ^ 


•  "  jp  i 


and  its  determinant  by 


x^,  X2'  •••»  ^p»  3i»  D2»  •••»  3p| 


Using  this  notation  the  determinant  of  £q  is  given  by 


So  I  *  ®kk  ^  f'0k'''’00®kk  1^0 


k-1 


0;k 


By  assumption 


I 


k;k 


k-1 


k-1 


■  ifi  i!o 


0;k 


1Eq'"'*|  and  the  appropriate  lower  order  minors  can  be  evalu¬ 
ated  in  the  same  manner  as  [EqI.  Continuing  in  this  fashion 
we  obtain 

k  k 

IEqI  -  (1  -  i  pJi)  n  o^j^. 

i«l  i»0 

Using  this  same  method  of  evaluation,  we  also  note  that 
any  (k+l-r) —  i  principal  minor  of  (a  minor 

formed  by  deleting  the  same  r  rows  and  columns)  can  be  of  two 


possible  forms. 
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Case  (a) 

I£  the  0^  row  and  column  are  deleted,  the  principal 
minor  is  given  by 

k 


i  ^  i^ 


Casa  (b) 

If  tha  0—  row  and  column  are  not  deleted,  the  submatrix 
is  of  the  same  form  as  Eg  and  the  corresponding  principal 
minor  is  given  by 


k 

(1  •  I  qL) 

i-1 

(i  ij^,  ijr 


n  0^^. 

i-1 


ij.) 


The  proof  can  be  completed  by  using  the  representation 
that  symmetric  matrices  are  n.n.d.  iff  all  principal  minors 
are  nonnegative  and  noting  the  expressions  for  |Eg|  and  the 
principal  minors  of  Eg. 


Lemma  4 . 4 

(4.33)  Yij  >  (i  <  j  <  k)  , 


when 


58 

(4.34)  +  pJjjj  +  pJj^j  <  1. 

Proof ; 

Let 


f 


'ij 


Y 


ik* 


Then  after  canceling  terms,  we  have 


(4.35) 


’Olj] 


^0(j]  “  Poik]  Po[k] 
^OCjlPoik)  P0(i]P0tjl 


"  P0[i]P0[k]  " 
^  ‘^0(k]P0(jl* 


We  note  that 


3f/3pOjij  -  +  2pJ^jpJjjj 

*  ^Poij]  "  PQIk]^ 

<  0. 

Hence  f  is  a  nonincreasing  function  of  Pq^^j*  To  show  that 
f  ^  0  we  Increase  Pqj^j  until  either 

(4.36a)  pJ^j  -  1  -  pJjjj  -  pJjj^j 

or 

(4.36b)  pJj^j  -  pJjjj  , 


whichever  occurs  first 
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Cai«  (>) 

Substituting  (4.36a)  into  (4.35)  and  denoting  this  ex 
pression  by  we  have,  after  canceling  terms 

*  ^Po(k)  “  PQIjJ^  ■  ^PQfk]  ‘  ^Otj]^ 

-  ^‘^OtkJ  “  P0(j]^  “  P0[j}^^0[k]  ■■  P0[j]^ 

>  0. 


Case  b 

Substituting  (4.36b)  into  (4.35)  and  denoting  thit;  ex 
pression  by  we  have 


(4.37) 


^b  “  P0[j]  PoijJ  “  P0[j]  ‘  P0[k]  P0(kj 
“  ^^OtjjPQlk)  P0lk]P0[j]' 


and 


3p5rjj  +  -  1 


^Pcijl^Otk]  Po[k] 


It  is  evident  that 


^^b/^Po[jl  - 


when  pQ[]^]  L  1/4*  From  (4.34)  and  (4.36b)  we  obtain 


or 


P0[jj  - 

^Olk]  -  ^ 


so  that 

3fb/^P0[jl  i  ^P0(j]  “  ^P0[j]' 

when  pQjj^j  i  1/4.  But  (4.34)  and  (4.36b)  imply  that  Po[j] 
3/8  and  hence 


Using  the  same  method  as  before,  we  increase  pQjjj  until 


(4.38b') 


or 


(4.38b")  Po(j]  ■  (1  " 
whichever  occurs  first. 


Case  b* 

Substituting  (4.38b')  into  (4.37)  and  denoting  this  ex 
pression  by  obtain  f^,  ■  0. 
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Case  b" 

Substituting  (4.38b”)  into  (4.37)  and  denoting  this  ex¬ 
pression  by  f^i, ,  we  obtain 

V  ■  <-  ’<’0(10  *  ’Potk)  *  Ooik)  - 

For  this  case  (4.38b")  also  implies 
-  PQDc]  - 

which  gives  us  fj^n  >  0  and  completes  the  proof. 

Lemma  4.5 

(4.33)  Yij  >  0  (i  <  j  <  k) , 
when 


^0(i]  ^  P0[j]  PoikJ  - 

Proof ; 

The  proof  follows  directly  from  Lemma  4.4  by  noting  that 
Yjj^  ^  1.  (Yjjj  is  the  covariance  between  two  random  variables 
each  having  unit  variance.)  Hence 

1  -  Kik  -  Yjk  +  Tij  >  1  - 

>  0, 

and  the  lemma  follows  immediately. 
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Theorem  4.2 

(4.40)  ^2^ 


k-.l 

n 

i=l 


♦  (Ej^) 


Proof ; 

The  proof  follows  directly  froiu  Lemma  4.3  and  Slepian's 
inequality  (Lemma  B.2). 

We  use  this  theorem  to  find  a  lower  limit  for  the  PCS^ 
over  the  region  of  preference  for  a  correct  selection  and  thus 
obtain  a  conservative  approximation  to  the  asymptotic  sample 
size.  (This  approximation  is  conservative  in  that  it  will 
always  be  greater  than  the  true  asymptotic  sample  size.) 


(/n/2)  log  0*. 
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Proof ; 

This  theorem  follo'./s  from  Theorem  4.2  and  the  fact  that 

Yij,  >  0  (i  »  1,  2 . k-1). 

It  is  disconcerting  to  note,  however,  that 

k-l 

infimum  n  ^(e.)  >  {'l>(e*)} 
i»l  ^ 

(4.42)  >  6- 
I Q  n  •  n .  d . 

for  general  k. 

Setting  the  r.h.s.  of  (4.42)  equal  to  P*  we  obtain  a  con¬ 
servative  asymptotic  approximation  to  the  Scunple  size 

(4.43)  /n  =  24“^(P*^^^^"^^  )/log  O*. 

If  the  experimenter  has  a  priori  knowledge  that 

(4.44)  Polk)  -  P*^' 

where  is  a  preassigned  constant  satisfying 

(4.45)  <  {(k-l)0*  -  (k-2)  }/{(k-l)e*  +  1}, 


a  stronger  result  can  be  obtained. 
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and  is  given  in  Theorem  4.3. 

Proof ; 

The  proof  follows  directly  from  Lemma  4.2  by  noting  that 
Pq[j^j  *  and  »  9*  (i  =  1,  2,  ...,  k-1)  yield  the  r.h.s. 
of  (4.46)  and  1q  is  n.n.d. 

Setting  the  r.h.s.  of  (4.46)  equal  to  P* ,  we  obtain  the 
following  approximation  to  the  asymptotic  sample  size  using 
the  a  priori  information  (4.44): 

/n  -  24  ^^)/(/l  -  Y*  log  0*). 


(4.47) 
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4.2  (k  >  2,  t  >  1) 

In  the  remaining  sections  of  Chapter  4  we  consider  the 
problem  of  selecting  a  set  of  predictor  variates  of  fixed  but 
arbitrary  number  t  (1  <  t  <  k).  We  have  not  been  able  to  show 
that  the  conditions  required  for  the  Slepian  inequality  (Lemma 
B.2)  are  satisfied  (although  we  conjecture  that  they  do  hold). 
Hence  we  use  the  Bonferroni  inequality  (Lemma  B.l)  to  obtain  a 
lower  bound  on  the  PCS^  and  proceed  in  a  manner  similar  to 
that  of  Section  4.1.4. 

4.2.1  Notation 

Using  the  a  notation  of  Chapter  3,  we  define  the  k-vector 
a*  in  terms  of  the  two  k-vectors  a  and  a"  by  giving  the  i“ 
component  of  a*  as 

(4.48)  a*  =  max(a.  ,  a"). 

1  X  i 

We  let  denote  the  ordered  p-tuple  (t  ^  p  <  min(k,  2t) ) 

th  th 

whose  i —  component  is  given  by  the  position  number  of  the  i — 

nonzero  component  of  the  vector  a.  For  exeunple,  if  (k  =  5, 

t  ■  3) ,  a  *  (1,  0,  0,  1,  1),  and  a"  =  (!»  0,  1,  0,  1),  then 

a*  -  (1,  0,  1,  1,  1),  u^  -  (1,  4,  5),  u^„  =  (1,  3,  5),  and 

u^.  -  (1,  3,  4,  5). 
a" 

Since  the  predictor  variates  are  uncorrelated,  we  then 


have 
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"S.a  =  Ooj' 

where  means  tliat  j  takes  on  the  values  of  the  components 

of  u  .  Hence  (3.3)  becomes 
a 

^O.a  PQj^' 

-*  a 

The  population  multiple  correlation  coefficient  between  and 
the  set  consisting  of  those  predictor  variates  associated  with 
Rq  ^  or  Rq  is  denoted  by  Rq  and  can  be  written  as 


(4. 

51) 

^O.a*  “ 

z 

Poj- 

For 

the 

previous 

example 

we 

then 

R^ 

0 .  a 

POI  ^  P 

04  ^05 

(4. 

52) 

0  •  a 

^Sl  ^ 

^03 

^  P05 

^O.a* 

Pqi 

P03 

P04 

We  denote  the  ranked  values  of  the  „  and  ,  and 

U  •  Cl  V  •  Q 

if  j 

their  estimators  in  the  usual  manner.  And  we  let  E  de- 

m 

note  the  summation  of  the  over  ';he  values  of  the  subscript 
of  the  variates  contained  in  the  union  of  the  two  sets  of 
predictor  variates  associated  with  Rq  and  R^  Thus, 

if  (k  »  5,  t  ■  3)  and  the  variates  associated  with  Rq 


and 
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ITq  [j]  ^1'  ^2'  ^3'  ^4'  ^•®P®ctlvely ,  then 


*^0.  [j] 

are 

ir  j 

(4.53) 

Z 

m 

^Om  ^  ►'Om 
m*l 


E  p; 


4.2.2  ProbcUaility  of  correct  selection 

Using  the  notation  of  Section  4.2.1,  we  define 


(4.54) 


(4.56) 


and 


1  -  Z 


m 


Om 


(1  - 


^0.  [il^  ■  ^0. [j]^ 


(i^j ;i,j=l,2,.  . . ,U“1)  , 


(4.55)  Yi  » 


2/1  -  A^y 


(1=1,2,. . . ,U-1)  , 


/n  log((l  -  (1 

2/r^“ 


"o-iuj” 


(1=1,2,. ..,U-1) , 


(i=l,2,. . . ,U-1) 

Then  (3.24)  can  be  written  as 

(4.57)  PCS  =  P{w^  <  (i  =  1,  2,  ...,  U-1)} 

Using  Corollary  A. 8,  we  obtain  the  following  asymptotic 
(N  -*■  <*>)  approximation  for  (4.57). 


68 


(4.58)  PCCa  =  ^2'  •••'  ^U-l^  ' 

where  ®  standard  (U-1) -variate  normal  distribution 

function  having  zero  means,  unit  variances  and  off-diagonal 
covariances  given  by 

(1  -  X  -  X  +  X.  .) 

(4.59)  )).  »  - -  . U-1) 

^  2/(1  -  l.„)(l  -  Xjy) 

Using  this  expression  for  the  PCS_,  we  obtain  the  follow- 
ing  theorem. 

Theorem  4.5 

infimum  PCS^  >  1  -  (U-l)*(-^*), 

Vl,U  ^ 

£q  n.n.d. 

where 

®  (/n/2)  log  9* 

and 

®U-1,U  *  ■  ^0.  lU-1)  "  ^0.(UJ^* 

Proof : 

The  proof  follows  from  the  fact  that  X^y  ^  0  (i  « 

1,  2,  . . .  ,  U-1) . 
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Setting  the  r.h.s.  of  (4.60)  equal  to  P*,  we  obtain  an 
approximation  to  the  asymptotic  sample  size 

(4.61)  /n  =  -2*-*((l  -  P*)/(U-1)  )/log  6*. 

Although  there  is  no  general  set  of  parameter  configurations 
where  the  lower  bound  attains  this  lower  limit  for  all  t  and 
we  do  not  wish  to  analyze  each  t  separately,  we  do  note  that 
for  the  case  t  =  )c-l  this  lower  limit  is  attained.  For  this 
case  we  have  U  =  )c  and  using  Corollary  A.  8,  we  obtain  the 
following  theorem. 


Theorem  4.6 


k-1 

infimum  1  -  I  =  1  -  (k  -  1)<K-C*), 

i«l 

\,k-i  i  «* 

£  Q  n .  n .  d . 

where 

-  (/n/2)  log  e* 

and 

®k-l,k  “  "  ^0.[k-l]^/^^  "  ^O.lk]^- 

t^roof : 


Since  ^0  (i  ■  1,  2,  ..., 


k-1)  implies  that  Theorem 
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4.6  holds  whsn  th«  ■  in  (4.62)  is  rsplaead  by  >,  wt  n««d  only 

exhibit  a  pareimeter  configuration  for  which  the  lower  limit  is 
attained.  Since  t  »  k-1,  we  have 


i.j 

I 


m 


k 

Z 

i=l 


P 


0[i]* 


Hence  for  the  parameter  configuration 

pQjlj  »  1/(1  +  (k-l)/e*) 

(4.63) 

P0(il  “  Po[l]  (i-2,3,...,k-l) , 

we  have  *0  (i*l,  2,  ...»  k-1)  and 

=  (/n/2)  log  0*  (i=l,2,...,k-l)  , 

which  completes  the  proof  of  the  theorem. 


5 .  Predictor  variates  with  unknown  correlationa 

In  this  chapter  we  consider  the  case  where  the  correla¬ 
tion  structure  of  the  predictor  variates  of  the  (k+1) -variate 
normal  distribution  is  also  unknown.  The  Z  n.n.d.  requirement 
does  not  reduce  to  a  simple  set  of  inequalities  for  arbitrary 
k  as  did  the  Zq  n.n.d.  requirement,  although  the  additional 
pareuneters  (i  ^  j;  i,j  =  1,  2,  . . . ,  k)  make  the  Z  n.n.d. 

requirement  less  stringent. 

We  consider  only  the  case  of  t  =  1.  For  t  >  1,  the  cor¬ 
relation  between  the  sample  conditional  variances  becomes 
extremely  messy.  Throughout  this  chapter,  we  use  the  notation 
given  in  Section  4.1.1. 
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(5.2) 


^  log (s' 


o^iy  . k-1), 


2/1  -  u) 


ik 


/n  log((l  -  P^,.,)/(l  - 

(5.3)  — (i-=l,2,. . .  ,k-l)  , 


2/1  -  0) 


ik 


and 


"i  ■  H 


(l.l,2,...,k-l). 


Then  the  PCS  (3.24)  can  be  written  as 


(5.4)  PCS  *  P{w^  <  (i  -  1,  2,  ...,  k>l)}. 

Using  Corollary  A. 4  and  (5.4),  we  can  write  the  PCS. 

A 

(3.25)  as 

(5.5)  P^S^  *  ^k“l^"^l^  ^2^  •••»  "^k”!^^ 

v/here  is  the  (k-1) -variate  standard  normal  distribution 

function  with  zero  means,  unit  variances,  and  off-diagonal 
covariances  given  by 


(1  -  U)  -  U)  +  0)  ) 

(5.6)  u)!  ■  - - -  ^  (if<j;i,j-l,2,...,k-l)  . 

^•'(l  -  “iK)  (1  -  V 

5.2  k  -  2 

first  consider  the  case  (k  -  2,  t  -  1),  where  the  PCS. 

A 


to 


(5.5)  reduces 
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(5.7)  PCS^  =  ^(T^) . 

Preliminary  to  finding  the  infimum  of  this  expression  we 
prove  a  lemma  which  gives  a  simple  representation  of  the 
Z  n.n.d.  requirement  for  k  =  2. 

Lemr'a  5 -I 

For  k  =  2  the  requirement  that  E  be  n.n.d.  is  equivalent 
to  the  following  inequalities: 

Oii  >  0  (i=0,l,2) 

(5.8) 

^0(1)  ^0(2]  ^11112)  ■  ^P0(l]^0[2)  ^[1]  [2]  - 

Proof : 

As  before,  this  equivalence  can  be  established  by  using 
the  representation  that  a  symmetric  matrix  is  n.n.d.  iff  all 
principal  minora  are  nonnegative. 

Theorem  5.1 

infimum  PCS^  “  <I'(t*), 

a 

(5.9)  02^2  - 

E  n.n.d. 

where 


T  *  s 


(/n/2)  log  0* 
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and 


=  (1  - 


^0[2]^ 


Proof ; 

Since  theorem  holds  when  the  =  of  (5.9)  is 

replaced  by  Hence  we  need  only  exhibit  a  parameter  con¬ 
figuration  satisfying  the  two  restrictions  of  the  theorem  and 
for  which  “  0*  0^®  such  parameter  configuration  is  given 

in  (4.27)  . 

The  asymptotic  sample  size  is  the  same  as  given  by 
(4.28).  It  is  also  interesting  to  note  that  for  any  given  0*, 
the  infimum  can  be  attained  for  more  than  one  parameter  con¬ 
figuration,  whereas  in  Theorem  4.1,  for  any  given  0*,  the 
infim\im  was  attained  for  only  one  parameter  configuration. 


5.3  k  >  2 

We  have  been  unable  to  show  that  the  conditions  (Lemma 
B.2)  required  for  the  use  of  the  Slepian  inequality  hold  when 
the  predictor  variates  are  no  longer  assumed  to  be  uncorre¬ 
lated.  Hence  we  use  the  Bonferroni  inequality  to  obtain  a 
lower  bound  on  the  PCS^.  Using  (5.5)  and  (B.l)  we  have 

k~l 

(5.10)  PCS^  >  1  -  Z  ♦(-T. ) . 

o  •  ,  X 

.  1=1 

In  the  following  theorem  we  find  the  infimum  of  this 
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Poiij  -  *^1/(1  +  e*) 

(i=l,2,... ,k-l) 

'’olkl  -  +  «*> 

; 

(5.12) 

^lil  (j]  "  ^ 

(i^j;i,j=l,2, . . . / k— 1 ) 

^[i] (k]  “  ° 

(i=l,2,...,k-l). 

The  restriction  i  9* 

cind  ~  ^  •  m  ^ 

k-1)  follow  directly-  To  prove  the  existence  of  a  nonnegative 
definite  covariance  matrix  corresponding  to  this  configura¬ 
tion,  we  assume  without  loss  of  generality  that  the  predictor 
variates  are  ordered  according  to  increasing  so  that 

is  the  «/ariate  associated  with 

Wo  use  the  result  (e.g.,  Anderson  (2])  that  a  k  +  1  by 
k  +  1  symmetric  matrix  I  is  n.n.d.  if  there  exists  a  k  +  1  by 
r  matrix  A,  where  r  <  k  +  1  is  the  rank  of  Z,  such  that 
T  AlA’  and  T  is  positive  definite. 

For  our  problem,  r  =  2  and  we  have 


(5.13)  A 


1  0  0  0  . 

0  10  0. 


0 

0  , 


so  that 
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(5.14) 


T 


1 

/l/(l+0*) 


/i/(i+e*) 

1 


Since  6*  >  1,  we  have  immediately  that  the  principal 
minors  of  T  are  positive.  Hence  T  is  positive  definite »  and 
the  proof  is  complete. 

Setting  the  r.h.s.  of  (5.11)  equal  to  P*,  we  obtain  an 
approximation  to  the  asymptotic  sample  size 

(5.15)  /n  -  -2*“M(1  -  P*)/(k  -  l))/log  0*. 


f 
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6 .  Directions  for  future  research 

Several  interesting  problems  are  suggested  by  the  results 
of  this  thesis.  We  pose  some  of  these  problems  in  this 
section . 

Throughout  Part  XI  we  used  the  Bonferroni  and  Slepian 
inequalities  to  find  lower  bounds  on  the  PCS^^  and  then  from 
these  lower  bounds  we  obtained  approximations  to  the  Scimple 
size.  It  would  be  interesting  to  compare  these  lower  bounds 
on  the  PCS  with  approximations  of  the  PCS  (obtained  by 
numerical  integration)  to  get  some  idea  of  the  "efficiency"  of 
the  inequalities  in  the  problems  we  have  considered.  Of  par¬ 
ticular  interest  would  be  a  comparison  of  the  infimum  (ob¬ 
tained  by  nujnoric£il  search  techniques)  of  this  approximation 
of  the  PCS^  over  the  region  of  preference  for  a  correct  selec¬ 
tion  with  the  analytical  results  obtained  from  the 
inequalities . 

A  simpler  problem  which  is  also  of  interest  is  a  compari¬ 
son  of  the  lower  limit  given  in  Theorem  4.3  with  the  infimum 
(obtained  by  nvimerical  search  techniques)  of  the  lower  bound 
of  Theorem  4.2. 

All  of  the  results  in  Part  II  have  been  obtained  using 
asymptotic  distribution  theory.  It  would  be  interesting  to 
compare  the  PCS^  obtc^ined  from  this  theory  with  Monte  Carlo 
estimates  of  the  PCS.  Of  particular  interest  is  the 
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comparison  of  these  estimates  of  the  PCS  with  the  PCS^  as  a 
function  N  and  0*,  when  the  parameters  are  in  the  LFC. 

The  sensitivity  of  the  results  of  this  thesis  to  the 
assumption  that  the  distribution  of  (Xq,  Xj^)  is 

multivariate  normal  is  another  problem  that  could  be  studied 
using  Monte  Carlo  techniques. 

The  results  of  Sections  4.2  and  5.3  could  be  strengthened 
if  the  Slepien  inequality  could  be  used  in  place  of  the 
Bonferroni  inequality.  Although  the  conditions  required  for 
the  Slepian  inequality  (Appendix  B)  appear  to  be  satisfied  for 
these  more  general  cases,  we  have  been  unable  to  prove  this 
analytically.  The  method  employed  in  Section  4.4.4  to  obtain 
this  result  appears  to  be  too  complicated  and  messy  for  these 
more  general  cases  and  hence  a  new  approach  is  needed. 

An  obvious  but  seemingly  difficult  problem  is  the  exten¬ 
sion  of  the  results  of  Chapter  5  for  t  =  1  to  arbitrary  t  or 
at  least  to  t  =  2.  The  major  difficulty  here  is  the  compli¬ 
cated  nature  of  covariances  of  the  sample  conditional 
variances . 

In  all  of  the  problem  formulations  we  have  considered  in 
this  thesis,  the  value  of  t  was  fixed  prior  to  experimenta¬ 
tion.  In  some  situations  it  might  be  more  reasonable  to 
allow  the  value  of  t  to  be  determined  on  the  basis  of  the 
values  of  the  observations.  There  are  a  niomber  of  ways  in 
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which  this  new  problem  could  be  formulated.  However,  it  ap¬ 
pears  that  the  same  difficulties  encountered  in  the  formula¬ 
tions  considered  in  this  thesis  would  also  be  encountered  in 
these  new  formulations. 


Appendix  A 

Asymptotic  joint  distributions  of  che 


sample  conditional  variances 

In  this  appendix,  we  find  the  asymptotic  joint  distribu¬ 
tions  of  the  Bcimple  statistics  which  are  used  in  the  decision 
procedures  of  Part  II.  All  of  these  asymptotic  distributions 
are  multivariate  normal  and  hence  are  defined  by  their  ex¬ 
pected  values,  variances,  and  covariances.  We  will  denote 
these  moments  of  the  asymptotic  distribution  by  E^,  Var,,  and 

a  a 

Cov  ,  respectively.  (For  each  of  the  cases  we  consider,  these 
quantities  are  equivalent  to  the  corresponding  asymptotic 
moments  of  the  exact  joint  distribution,  although  this  is,  of 
course,  not  true  in  general.) 

These  asymptotic  joint  distributions  are  derived  from  the 
following  two  theorems.  Theorem  A.l  is  an  immediate  result  of 
Anderson's  f2J  Theorem  4.2.5  and  Theorem  A. 2  is  a  generaliza¬ 
tion  of  his  Theorem  4.2.6  as  given  by  Rao  (44),  Section  6a. 2 
result  ill. 

We  let  n  =  N-t-1  and  define 


(A.l)  c^j  *  '^ij/"'^®ii®jj' 


where  v . .  is  the  ij—  element  of  the  cross  product  matrix 
given  by  (3.10). 
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Theorem  A.l 

The  asymptotic  ( (k+2)  (k+1) /2) -variate  normal  distribution 
of  /n  (i  <  jj  i»j  *  if  •••/  is  determined  by 


Ea(/ir  Cij) 


9 


Vara(/n  -  1  +  p?j. 


and 


CoVa(/Jr  /n 


HsPjr' 


Theorem  A«  2 

If  /n  u  has  a  p-variate  asymptotic  normal  distribution 
determined  by 

Ea(/n  u.)  =  b.  , 

Vara(/n  u^)  =  a^^^, 

Cov  (/^u.,  /nu.)  -a.., 
a'  i.'  j  ij' 

and  w  =  f (u)  is  a  vector-valued  function  of  the  p-vector  u 
such  that  each  element  of  w  is  totally  differentiable,  then 
w  has  an  asymptotic  multivariate  normal  distribution 
determined  by 
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Ea(*^  w^)  =  fj^(b)  , 
Var^(/n  w . )  =*  h.  Ah!, 

ci  X  ““X  “X 

and 


Cov  ( /n  w .  ,  /K  w . )  =  h .  Ah', 
a  1  D  ~D 

where  is  the  i~  component  of  f ,  A  is  the  p  by  p  covariance 
matrix  of  u,  and  the  h^  are  p-vectors  with  elements  given  by 

^ij  *  lu=b  ( j“l » 2 , . . .  ,p)  . 


A.  1  t  *■  1 

In  this  section  we  find  the  asymptotic  joint  distribution 
of  the 

(A. 2)  yl  -  ^(log(s>_,^,)  - 

(i=l, 2, . . . ,k-l) . 


To  obtain  this  result,  we  first  find  the  joint  asymptotic  dis¬ 
tribution  of  the  Sg  Using  the  notation  of  the  last  sec¬ 
tion,  we  have 


“O.i  ■ ‘’oo"=oo  - 


Applying  Theorem  A. 2  to  Theorem  A.l  and  (A. 3)  we  obtain 
the  asymptotic  distribution  of  the  /n  si  .  . 

U  •  X 
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Corollary  A.l 

The  asymptotic  k-variate  normal  distribution  of  the 
/n  s^  .  is  determined  by 

=  “oo'l  • 

®o.i>  ”  -  Ooi’"' 

and 

=°''a<’^  ®0.i'  ” 

-  ^oi  ■  Oflj  *  OoiPojOij’’ 

Applying  Theorem  A. 2  to  this  result,  we  obtain  the  asymptotic 
joint  distribution  of  the  /n  log  s^  . . 

U  •  X 

Corollary  A. 2 

The  asymptotic  k-variate  normal  distribution  of  the 
/n  log  Sq  ^  is  determined  by 

E^(/n  log  -  log  . 

Vdt^(A\  log  Fq  =  2. 


and 


Cov^(/n  log  /n  log  sj  j)  = 


2(1  -  p;^.  -  .  +  p.,p..p.  .)^ 

Oi  0i  Oi  On  IT 


(1  -  P-) 


From  this  result  using  the  notation  of  Section  3.4  we  obtain 
the  corresponding  result  for  the  ^/n  log  s^  • 


Corollary  A. 3 

The  asymptotic  k>variate  normal  distribution  of  the 
/n  log  s^  is  determined  by 

E,(/S  log  .  log(0(,„(l  -  oj^,)), 


where 


’'»ra('^  log  s  =  -  2, 


Cov,(/n  log  ,  /n  log  -  2o.  . 


(i?^j)  » 


(1  -  p*  -  +  p  p  p  )  ^ 

- °011)  ^Ojj)  0(1)  oiji^iiui  , 

ij  (1  -  Poiii’d  -  Poij)' 


and  the  defined  in  Section  5.1. 

Using  this  result  we  again  apply  Theorem  A. 2  to  obtain 
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the  following  corollary  which  is  used  in  Section  5.1  to  ex 
press  the  PCS_  in  terms  of  a  standard  multivariate  norma] 

cl 

distribution  function. 


Corollary  A. 4 

The  asymptotic  (k-1) -variate  normal  distribution  of  the 


vn  xogvs 


0.  {1)^  0.  (k-i-H) 


2/1  - 


(i“l»2f . . . fk— 1) 


U) 


ik 


is  determined  by 


(Vi) 


log ( (1  -  p 


0[i] 


)/(l  - 


^ow>> 


2/1  - 


(u 


ik 


Vara(yi)  »  I, 


and 


CoVa(y^ ,  Yj)  =  (i/j) , 

where 

(1  -  U).,  -  u  ,,  +  U)  ) 

'■''l  -  “u'  ‘1  -  “ik'" 

For  the  case  of  uncorrelated  predictor  variates 

“  0  (i  /  j;  i,j  =  1,  2,  ...»  k>,  and  we  have  the 
following  corollary  which  is  used  in  Section  4.1.2  to  give  an 
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•xprttsslon  for  the  PCS^^  (4.20). 


Corollary  A. 4a 

The  asymptotic  ()c-l) -variate  distribution  of  the 


(A. 4) 


(l)/*^0.  (k-i+1)  ^ 


2/1  -  Y 


ik 


(i=l,2,. 


is  determined  by 


Ea<yi) 


Xog((i  -  p;i,,)/(i  - 


2/1  - 


^k 


Var^(yj,)  =  1, 


and 


Cov^(yi.  Yj) 


where 


ID 


(1  -  Y..  -  Y..  +  Y.  .) 
_ iiL-..  DK  ^3 

2/(1  -  Yi^)(l  -  Yjk) 


and 


Y 


ij 


(1 

IT" 


~  "0[i]  '  ^O(j)^ 
P0(il^  ■  Po[j] 


.  .,k-l) 


(i/j)  , 


(i/j) 


(i/j)  . 
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A. 2  t  >  1,  uncorrelated  predictor  variates 

In  this  section  we  find  the  asymptotic  joint  distribution 
of  the  saunple  conditional  variances,  each  of  which  is  based  on 
t  uncorrelated  predictor  variates.  Again  our  objective  is  to 
find  the  joint  distribution  of  the  y|  (A. 2),  and  we  proceed  in 
a  manner  similar  to  that  used  in  the  previous  section  by  first 
finding  the  asymptotic  joint  distribution  of  the  s^ 

The  following  corollary  is  obtained  using  Theorems  A.l 
and  A.  2,  after  writing  the  Sq  ^  in  terms  of  the  c^^j  in  a  man¬ 
ner  similar  to  that  of  (A.  .3).  The  expressions  for  the  Sq  ^ 
are,  of  course,  considercibly  more  complicated  than  (A. 3). 


Corollary  A. 5 

The  asymptotic  U-variate  normal  distribution  of  the 

/n  si  is  determined  by 
0 .  a 


E  (/n  s^  )  =  o  (1  -  ) 

a  O.a  00  O.a 


Var^(/n  s  =  =  2o=j(l  - 


and 


Cov^ {/n 

O 


’O.a' 


/n 


’O.a" 


)  = 


2^00^1  - 


'0.a‘ 


(a?^a") 


where  a* ,  u  ,  and  *  are  defined  in  Section  4.2.1. 
a  O.a" 

We  apply  Theorem  A.  2  to  this  result  to  obtain  the 


following ; 


I 


89 


Corollary  A.  6 

The  asymptotic  U-variate  normal  distribution  of  the 
/n  log  s3  is  determined  by 

U  •  Cl  * 


E^(^  log  s  =  _^)  =  log(0„„(l  - 
Var^(/K  log  =  2, 

and 


Cov^(/n  log  s*^^, 


u  .a 


2(1 


- 


(a/a") . 


From  this  result  using  the  notation  of  Section  4.2.1,  we 
obtain  the  corresponding  result  for  the  /n  log  Sq  . 


Corollary  A.  7 

The  asymptotic  U-variate  distribution  of  the  /n  log  Sq 
is  determined  by 

log  =  log(o„j(l  - 

Vara(^  log  =5.,^,)  =  2, 

and 

Cov^{^  log  /n  log  =  2X^.  (i^^j)  , 


90 


where 


(1  -  r 

m 


(1  -  -  RS.ij,) 


(i?^j) 


and  the  svammation  notation  is  defined  in  Section  4.2.1. 

Using  this  result,  we  agai  n  apply  Theorem  A.  2  to  obtain 
the  following  corollary,  which  is  used  in  Section  4.2.2  to  ex¬ 
press  the  PCS^  (4.58)  in  teirms  of  a  standard  multivariate 
normal  distribution  function. 


Corollary  A. 8 

The  asymptotic  (U-1) -variate  normal  distribution  of  the 


(A.  5) 


2/1  -  X 


( i®l, 2 , . . . ,U-1) 


ik 


is  determined  by 


E^Cyi) 


2/1  -  1^^ 


var^fyj)  =  1, 


and 


Cov  (y.  ,  y .)  =  X:  . 
a  •'i  ?  3  13 


f 


where 
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Appendix  B 

The  Bonferronl  and  the  Slepian  inequalities 


After  8tati..j  the  Bonferi.oni  and  the  Slepian  Inequalities 
as  lemmas ,  we  use  each  of  them  to  obtain  lower  bounds  on 
multivariate  normal  probabilities.  These  results  are  used  in 
Part  II  to  give  lower  bounds  on  the  expressions  for  the  PCS^. 
First  we  give  the  Bonferronl  inequality  (e.g.,  Feller 

[18] ,  p.  100)  . 

Lemma  B. 1 

Let  A^,  A2»  ..•>  Ap  denote  a  sequence  of  events  and  let 

A^  denote  the  con^lement  of  the  event  A^.  Then 

P  P 

P{  U  A.  }  <  E  P{A.} 
i-1  i-1 

and  hence 

P  P 

P{  n  A.  }  -  1  -  P{  U  AM 
i-1  i-1 

P 

>  1  -  E  {A!} 
i-1 

If  we  define  the  events 

-  tYi  <  (i-l,2,...,p) , 
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then  by  this  lemma,  we  have 
(B«l)  ^2*  •*•»  ^ 

where,  as  before,  denotes  a  standard  multivariate  noirmal 
distribution  function  having  zero  means  and  unit  variances. 
The  following  lemma  is  due  to  Slepian  [48]. 


P 

-  I  a  -  <t(a.))  , 
i=l 


Lemma  B . 2 

Let  and  denote  p-variate  normal  distribution  func- 
P  P 

tiona  with  zero  expectations  and  nonnegative  definite  covari¬ 
ance  matrices  given  by  and  respectively.  If 

“•ij  -  "^ij  (i/j;i,j=l,2,.  . .  ,p) 

and 


whenever 
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and 


In  the  following  lemma,  we  show  that  the  Slepian  in¬ 
equality  (when  its  assumptions  are  satisfied)  gives  a  better 
bound  than  does  the  Bonferroni  inequality. 


Lemma  B.3 

P  P 

(B.3)  n  4(a.)  >  1  -  E  (l-»(a.)) . 

i=l  i=l 

Proof ; 

This  result  follows  directly  from  the  Bonferroni  in¬ 
equality  by  letting  a^^,  a2»  ...»  ®  sequence  of  events 

and  denoting  the  probeibility  of  the  event  a^  by  "Ka^). 


Appendix  C 

A  numerical  comparison  of  the  Bonferroni 
and  the  Slepian  inequalities 

In  this  appendix  we  study  the  efficiency  of  the  Bonfer¬ 
roni  and  the  Slepian  inequalities  by  using  each  of  them  to 
approximate  the  sample  size  for  a  problem  in  which  the  exact 
Seimple  size  is  known.  The  problem  is  that  of  selecting  the 
univariate  normal  population  with  the  largest  population  mean, 
when  each  of  the  populations  has  a  common  known  variance  (o^). 
The  PCS  for  this  problem  cai  be  expressed  (Bechhofer  [5])  in 
terms  of  a  (k-1) -variate  r^tandard  normal  distribution  function 
with  zero  means,  unit  variances,  and  off-diagonal  covariances 
of  1/2. 

Using  the  Bonferroni  and  the  Slepian  inequalities  to  give 
lower  bounds  on  this  PCS  expression,  we  obtain  two  conserva¬ 
tive  approximations  to  the  sample  size  denoted  by  and  Ng, 
respectively,  where 

(C.l)  -  2(o/6*)$-M(1  -  P*)/(k  -  D), 

{C.2)  «=  2(o/6*)  ) 

and  {6*,  P*}  are  the  preassigned  constants. 

The  exact  sample  size  N  can  be  computed  by 
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(C.3)  /N  =  (o/6*)d(P*,  k)  , 

where  the  values  of  d  are  given  in  Table  1  of  Bechhofer  (5j. 

The  values  of  N^/N  and  Ng/N  are  given  in  Table  C.l  ..or 
k  =  3,  6,  9  and  P*  values  ranging  from  .5  to  .999. 

For  high  P*  (P*  ^  .99),  both  of  the  inequalities  provide 
good  approximations  to  the  exact  sample  size  (within  5%  for 
k  ^  9) ,  ana  the  Ng  is  only  slightly  larger  than  Ng. 

For  P*  <  .9,  neither  of  the  inequalities  provides  a  good 
approximation  to  the  sample  size  (Ng  and  Ng  are  both  at  least 
15%  greater  than  N  for  the  values  of  k  considered) ,  and  Nfi/N 
is  considerably  larger  than  Ng/N. 

For  .9  ^  P*  <  .99,  the  Slepian  inequality  provides  a 
reasonable  approximation  when  k  is  not  too  large,  but  the 
approximation  provided  by  the  Bonferroni  inequality  is  not  as 
good. 


Table  C. 1 


A  Comparison  of  Sample  Approximations  Derived  from 
the  Bonferroni  and  the  Slepian  Inequalities 
with  the  Exact  Requirements  for  the 
Problem  of  Ranking  Normal  Means 


k  = 

3 

6 

9 

P* 

Nfl/N 

Ng/N 

Ng/N 

Ng/N 

Ng/N 

Ng/N 

.5 

2.93529 

1.91768 

2. 47320 

1.91914 

2.35754 

1.92193 

.6 

1.80715 

1.45038 

1.85936 

1.58701 

1.85087 

1.62104 

.7 

1.40178 

1.25527 

1.51766 

1.38337 

1.54029 

1.42128 

.8 

1.20335 

1.14557 

1.30124 

1.24123 

1.32900 

1.27371 

.9 

1.08838 

1.07172 

1. 14913 

1.12988 

1.17077 

1.15237 

.95 

1.04652 

1.04070 

1.08496 

1.07732 

1.10037 

1.09340 

.99 

1.01444 

1.01376 

1.02948 

1.02857 

1.03660 

1.03568 

.995 

1.00941 

1.00912 

1.01980 

1.01941 

1.02500 

1.02460 

.999 

1.00381 

1.00376 

1.00846 

1.00839 

1.01101 

1.01089 
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